The ControlWorkflows
repository hosts the configuration tree for AliECS workflow templates (WFT) and task templates (TT).
- FLP Suite Workflow Configuration
This is mostly relevant to coconut
users and users who edit or produce their own WFTs and TTs.
All AliECS variables are strings. Some workflow templates can choose to treat a string variable as a JSON payload, with built in functions such as ToJson
/FromJson
, but they are always provided as strings by the user, and stored as strings in the key-value map.
Boolean values are also stored as strings, i.e. "true"
and "false"
. This is because they can be read in as template expressions, i.e. strings, which are then processed into boolean values during workflow loading.
All variables except hosts
are optional.
Variable | Description | Example | Default |
---|---|---|---|
hosts |
JSON-formatted list of hosts to control the scale of certain workflows | ["myhost1","myhost2"] |
[] |
log_task_output |
Forward task output to InfoLogger and stdout, stdout only, or nowhere | all , stdout or none |
depends on WFT, usually none |
readout_cfg_uri |
URI of a Readout configuration payload | consul-ini://{{ consul_endpoint }}/o2/components/readout/ANY/any/readout-standalone-{{ task_hostname }} |
depends on WFT |
user |
Name of the Linux user that should run all tasks | root |
flp |
All variables are optional.
Variable | Description | Default |
---|---|---|
roc_ctp_emulator_enabled |
If true, o2-roc-ctp-emulator will run during START to get data from CRU without LTU |
false |
roc_trigger_mode |
Trigger mode for o2-roc-ctp-emulator (only if roc_ctp_emulator_enabled is true ) |
continuous |
roc_ctp_emulator_endpoints |
List of CRU endpoints to emulate trigger (only if roc_ctp_emulator_enabled is true ) |
["#0"] |
detector |
Detector name string, used for dataspec strings and other task parameters | TEST |
dd_enabled |
If true, Data Distribution components will run and process data produced by Readout | true |
qcdd_enabled |
If true, QualityControl components will run and process data forwarded by StfBuilder |
false |
stfb_standalone |
If true, StfBuilder runs with --stand-alone and StfSender is disabled (only if dd_enabled is true ) |
false |
dd_discovery_net_if |
The name of the InfiniBand interface for StfSender output (only if dd_enabled is true ) |
lo |
dd_discovery_endpoint |
URI of the Data Distribution Consul instance (only if dd_enabled is true ) |
no-op:// |
roc_cleanup_enabled |
Run o2-roc-cleanup after environment shutdown |
true |
fmq_cleanup_enabled |
Run fairmq-shmmonitor -c after environment shutdown |
true |
fmq_severity |
Severity level for FairMQ (including Data Distribution) stdout messages |
info |
fmq_verbosity |
Verbosity level for FairMQ (including Data Distribution) stdout messages |
high |
rdh_version |
RDH version for StfBuilder (only if dd_enabled is true ) |
no-op:// |
The workflow requires a string json array of the hosts (e.g ["flp1","flp1"]
) when it is executed. It will retrieve all the cru cards and their endpoints.
The default behaviour, if no variables are passed, is to apply the configuration from the o2/components/readoutcard/<host>/cru/<card>/<endpoint>
to all cards and endpoints. All the parameters along with the default values are provided on the following tables. Examples on how to execute the workflow with coconut can be found here.
All variables are optional.
Variable | Description | Default |
---|---|---|
roc_config_uri_enabled |
If true, o2-roc-config will run using the --config-uri flag and the config found at the consul service under the {{ consul_prefix }}/{{ host }}/cru/{{ card }}/ |
"true" |
consul_port |
The port that consul service is running | 8500 |
consul_prefix |
The consul prefix where the cru configuration is stored | o2/components/readoutcard |
cards |
A string json array with all the cards in the host, provided by CRUCardsForHost(hostname) (e.g ["1251","1276"] ) |
{{CRUCardsForHost( hostname )}} |
card_endpoints |
A string with card endpoints separated by spaces, provided by {{EndpointsForCRUCard( host, card )}} (e.g 0 1 ) |
{{EndpointsForCRUCard( host, card )}} |
This will be executed when roc_config_uri_enabled
is disabled ("roc_config_uri_enabled":"false"
).
All variables are optional.
Variable | Description | Default |
---|---|---|
rocConfigLinks |
Value to be passed to the --links flag (e.g 1,2,3 or 0-11 ) |
0 |
rocConfigAllowRejection |
If true, the flag --allow-rejection will be set to the o2-roc-config command |
"false" |
rocConfigClock |
Value to be passed to the --clock flag |
LOCAL |
rocConfigCruId |
Value to be passed to the --cru-id flag |
0x0 |
rocConfigCrocId |
Value to be passed to the --crorc-id flag |
0x0 |
rocConfigDatapathMode |
Value to be passed to the --datapathmode flag |
PACKET |
rocConfigDownstreamData |
Value to be passed to the --downstreamdata flag |
CTP |
rocConfigGbtMode |
Value to be passed to the --gbtmode flag |
GBT |
rocConfigLoopback |
If true, the flag --loopback will be set to the o2-roc-config command |
"false" |
rocConfigPonUpstream |
If true, the flag --pon-upstream will be set to the o2-roc-config command |
"false" |
rocConfigDynOffset |
If true, the flag --dyn-offset will be set to the o2-roc-config command |
"false" |
rocConfigOnuAddress |
Value to be passed to the --onu-address flag |
0 |
rocConfigForceConfig |
If true, the flag --force-config will be set to the o2-roc-config command |
"false" |
rocConfigBypassFwCheck |
If true, the flag --bypass-fw-check will be set to the o2-roc-config command |
"false" |
rocConfigTriggerWindowSize |
Value to be passed to the --trigger-window-size flag |
1000 |
rocConfigNoGBT |
If true, the flag --no-gbt will be set to the o2-roc-config command |
"false" |
rocConfigUserLogic |
If true, the flag --user-logic will be set to the o2-roc-config command |
"false" |
rocConfigRunStats |
If true, the flag --run-stats will be set to the o2-roc-config command |
"false" |
rocConfigUserAndCommonLogic |
If true, the flag --user-and-common-logic will be set to the o2-roc-config command |
"false" |
rocConfigNoTfDetection |
If true, the flag --no-tf-detection will be set to the o2-roc-config command |
"false" |
rocConfigTfLength |
Value to be passed to the --tf-length flag |
256 |
rocConfigSystemId |
Value to be passed to the --system-id flag |
0x0 |
rocConfigFEEId |
Value to be passed to the --fee-id flag |
0x0 |
Below there are some examples on how to execute the o2-roc-config
workflow:
# Run o2-roc-config for all the cards and all endpoints using the configuration stored in consul under o2/components/readoutcard/<host>/cru/
coconut e c -a -w o2-roc-config -e '{"hosts":["<host>"]}'
# Run o2-roc-config for all the cards and specific endpoint (0) using the configuration stored in consul under o2/components/readoutcard/<host>/cru/
coconut e c -a -w o2-roc-config -e '{"hosts":["<host>"],"card_endpoints":"0"}'
# Run o2-roc-config for a specific card (1275) and all endpoints using the configuration stored in consul under o2/components/readoutcard/<host>/cru/1275/
coconut e c -a -w o2-roc-config -e '{"hosts":["<host>"],"cards":["1275"]}'
# Run o2-roc-config using the flag arguements with default values
coconut e c -a -w o2-roc-config -e '{"hosts":["<host>"],"roc_config_uri_enabled":"false"}'
# Run o2-roc-config using the flag arguements and `--force-config
coconut e c -a -w o2-roc-config -e '{"hosts":["<host>"],"roc_config_uri_enabled":"false", "rocConfigForceConfig":"true"}'
Variable | Description | Default |
---|---|---|
dcs_enabled |
If true and if the DCS plugin is enabled and configured, AliECS interfaces with the DCS service | false |
dcs_detectors |
The list of detectors for DCS SOR/EOR, passed as JSON list (see protofile for allowed values) | ["NULL_DETECTOR"] |
dcs_sor_parameters |
A key-value map of string parameters for DCS SOR, passed as JSON object | {} (empty JSON document) |
dcs_eor_parameters |
A key-value map of string parameters for DCS EOR, passed as JSON object | {} (empty JSON document) |
Example: coconut env create -w readout-dataflow -e '{"hosts":"[\"alio2-cr1-flp999\"]","dcs_enabled":"true","dcs_detectors":"[\"TPC\",\"ITS\"]"}'
Variable | Description | Default |
---|---|---|
ddsched_enabled |
If true and if the DD scheduler plugin is enabled and configured, AliECS interfaces with the DD scheduler | false |
Please note that dd_enabled
, which enables the FLP side of the Data Distribution chain, must also be true
for the DD scheduler to work, and incoming connections to StfSender
instances must be possible. The runtime variables dd_discovery_ib_hostname
, dd_discovery_stfb_id
and dd_discovery_stfs_id
are automatically generated by the readout-dataflow
workflow template and used by the DD scheduler plugin. In most cases no attempt should be made to override them.
Variable | Description | Default |
---|---|---|
odc_enabled |
If true and if the ODC plugin is enabled and configured, AliECS interfaces with the EPN cluster control | false |
odc_plugin |
The resource management system plugin to be used by ODC for deployment | empty string |
odc_resources |
A resource declaration used by ODC for deployment | empty string |
odc_topology_path |
Path to a DDS topology file, local to the ODC target machine | /etc/o2.d/odc/ex-dds-topology-infinite.xml |
Any variable prefixed with odc_
will be pushed to all ODC-controlled hosts as a task configuration parameter (ultimately a FairMQ options key-value). For example, setting odc_my_value: someValue
on AliECS pushes the key-value pair my_value: someValue
to ODC, which in turn propagates it to all controlled tasks.
In order for any of the integration variables to work, the AliECS core must have the relevant plugins enabled. The following plugin configuration parameters can be passed to the AliECS core at startup time as command line flags, or as YAML/JSON key-values in Consul /o2/components/aliecs/ANY/any/settings
. Prefix with //
if providing an IP address instead of a hostname.
--integrationPlugins strings List of integration plugins to load (default: empty)
--dcsServiceEndpoint host:port Endpoint of the DCS gRPC service (host:port) (default "//127.0.0.1:50051")
--dcsServiceUseSystemProxy When true the https_proxy, http_proxy and no_proxy environment variables are obeyed
--ddSchedulerEndpoint host:port Endpoint of the DD scheduler gRPC service (host:port) (default "//127.0.0.1:50052")
--ddSchedulerUseSystemProxy When true the https_proxy, http_proxy and no_proxy environment variables are obeyed
--odcEndpoint host:port Endpoint of the ODC gRPC service (host:port) (default "//127.0.0.1:50053")
--odcUseSystemProxy When true the https_proxy, http_proxy and no_proxy environment variables are obeyed
Example:
integrationPlugins:
- dcs
- ddsched
- odc
dcsServiceEndpoint: some-host:50051
ddSchedulerEndpoint: some-other-host-ib:50000
odcEndpoint: yet-another-host-ib:22334
It is possible to use FLP & QC DPL workflows that are generated just-in-time during environment creation.
To use JIT the following variables need to be set for FLP & QC node workflows accordingly:
flp_workflows_jit_enabled: true
qc_remote_jit_enabled: true
These might be set as default values or exist as radio buttons in the GUI. Otherwise they need to be overriden manually from the "Advanced Configuration" panel.
The JIT generation system relies on the existence & health of the following parts:
- DPL command provided
- The full DPL command can be found in
ControlWorfklows/jit/[workflow name]
- Alternatively, a custom DPL command can be provided through the "Advanced Configuration" panel, which will take precedence over the workflow normally selected through the interface. See the next subsection for details.
- The full DPL command can be found in
- Consul payloads (e.g. QC config files) contained in the DPL command
- These are parsed from the provided DPL command string and Consul is queried regarding their version to ensure freshness.
- JIT-specific env vars, which are common to all JIT-generated workflows
- These are expected on the deployment's Consul instance under
o2/components/aliecs/[defaults|vars]/jit_env_vars
- These are expected on the deployment's Consul instance under
- The O2 & QualityControl versions
- The O2 & QualityControl RPM versions are queried by AliECS to ensure workflow freshness.
A templated DPL command may be passed through the dpl_command
variable, prefixed with the detector code (
e.g. its_dpl_command
for the ITS). The variable may be set through the AliECS GUI environment creation page under
the "Advanced Configuration" panel.
For example, the equivalent of the minimal DPL workflow can be achieved by setting the following KV pair (assuming ITS as a target):
- through the "Add single pair:" key and value fields.
key: its_dpl_command
value:
o2-dpl-raw-proxy -b --session default --dataspec 'x:{{ detector }}/RAWDATA;dd:FLP/DISTSUBTIMEFRAME/0' --readout-proxy '--channel-config "name=readout-proxy,type=pull,method=connect,address=ipc:///tmp/stf-builder-dpl-pipe-0,transport=shmem,rateLogging=10"' | o2-dpl-output-proxy -b --session default --dataspec 'x:{{ detector }}/RAWDATA;dd:FLP/DISTSUBTIMEFRAME/0' --dpl-output-proxy '--channel-config "name=downstream,type=push,method=bind,address=ipc:///tmp/stf-pipe-0,rateLogging=10,transport=shmem"'
- through the "Add a JSON with multiple pairs:" field (make sure to escape the inner
"
):
{
"its_dpl_command": "o2-dpl-raw-proxy -b --session default --dataspec 'x:{{ detector }}/RAWDATA;dd:FLP/DISTSUBTIMEFRAME/0' --readout-proxy '--channel-config \"name=readout-proxy,type=pull,method=connect,address=ipc:///tmp/stf-builder-dpl-pipe-0,transport=shmem,rateLogging=10\"' | o2-dpl-output-proxy -b --session default --dataspec 'x:{{ detector }}/RAWDATA;dd:FLP/DISTSUBTIMEFRAME/0' --dpl-output-proxy '--channel-config \"name=downstream,type=push,method=bind,address=ipc:///tmp/stf-pipe-0,rateLogging=10,transport=shmem\"'"
}
Everything described in this subsection for the dpl_command
for FLP workflows is also applicable
to the qc_dpl_command
for QC node workflows.
In order to ensure a successful collaboration with AliECS GUI, every pull request will run a git diff
command to ensure no changes are done to the naming of certain variables (full list here) that are used on the AliECS GUI side.
If the checks identify any such breaking changes, the pipeline will fail with information on what labels were identified. In such case the developer of AliECS GUI should be notified before the pull request is merged.
These checks are ran automatically against macOS-latest
.
Source code can be found here.
This piece of documentation explains how to generate a DPL workflow template which should run on an FLP.
All FLPs use the readout-dataflow
workflow to run the common pieces of software - Readout, STFBuilder, STFSender, ROC and others.
It can also include DPL sub-workflows which may perform some data processing and/or quality control (max. one per FLP).
Additionally, a remote QC workflow can be added (e.g. with Mergers).
All available DPL workflows can be (re)generated with scripts stored in the scripts
directory.
Use generate-all-dpl-workflows.sh
to regenerate all the DPL workflows, or any particular script to regenerate only the selected one.
All scripts should be executed from within the scripts
directory.
When adding a new workflow template, please include the script to generate it with the PR and add it to generate-all-dpl-workflows.sh
, so it is ran each time we update the software stack.
Before exporting the workflow, one should prepare a DPL command which is able to get data from STFBuilder, process it and send the results to the STFSender. The first and the last requirements are handled by the input and output DPL proxies. The simplest possible DPL workflow command for an FLP receives data from STFBuilder and passes it to STFSender without any processing:
o2-dpl-raw-proxy -b --session default \
--dataspec 'x:TST/RAWDATA;dd:FLP/DISTSUBTIMEFRAME/0' \
--readout-proxy '--channel-config "name=readout-proxy,type=pull,method=connect,address=ipc:///tmp/stf-builder-dpl-pipe-0,transport=shmem,rateLogging=10"' \
| o2-dpl-output-proxy --environment "DPL_OUTPUT_PROXY_ORDERED=1" -b --session default \
--dataspec 'x:TST/RAWDATA;dd:FLP/DISTSUBTIMEFRAME/0' \
--dpl-output-proxy '--channel-config "name=downstream,type=push,method=bind,address=ipc:///tmp/stf-pipe-0,rateLogging=10,transport=shmem"'
The --dataspec
arguments define what kind of data is expected both in the input and output proxies.
The presence of the DPL_OUTPUT_PROXY_ORDERDED
env var is required by the DPL to correctly order output proxies.
The readout-dataflow
workflow requires that the input proxy contains a channel named readout-proxy
, while the output proxy should provide the channel downstream
.
This way the AliECS can find the other ends of the channels used by the STFBuilder and STFSender to send and receive data.
There can be only one set of proxies interacting with Data Distribution at the same time.
The pipe |
character connects the two or more workflows by letting the preceding one transfer its structure to the next one and letting the last execute all the processes (the pipe does not transfer data, just the workflow configuration!).
Let's consider a more realistic example - the PHOS compressor. In such case, the DPL command looks as follows:
o2-dpl-raw-proxy -b --session default \
--dataspec 'x:PHS/RAWDATA;dd:FLP/DISTSUBTIMEFRAME/0' \
--readout-proxy '--channel-config "name=readout-proxy,type=pull,method=connect,address=ipc:///tmp/stf-builder-dpl-pipe-0,transport=shmem,rateLogging=1"' \
| o2-phos-reco-workflow -b --input-type raw --output-type cells --session default --disable-root-output \
| o2-dpl-output-proxy --environment "DPL_OUTPUT_PROXY_ORDERED=1" -b --session default \
--dataspec 'A:PHS/CELLS/0;dd:FLP/DISTSUBTIMEFRAME/0' \
--dpl-output-proxy '--channel-config "name=downstream,type=push,method=bind,address=ipc:///tmp/stf-pipe-0,rateLogging=1,transport=shmem"'
Notice that the dataspecs
of the I/O proxies were changed accordingly to the expected data origin and description.
If you do not know what these should be, you should ask the developer of the workflow in question or investigate yourself the workflow structure by executing it with the --dump
argument.
While preparing the DPL command, please avoid adding Data Processors which dump data to local files - these might get big during the data-taking and use all available resources. Also, this particular workflow structure does not depend on any configuration files, so it always stays the same, unlike QC workflows. We will consider such cases later in the documentation.
Please make sure that you have built and compiled the same software stack on your setup as the one which will run on the target machines. If different versions are used, there is a risk that the workflows will not be deployed correctly in case that they were modified. Exporting workflow templates is possible since [email protected].
To export a workflow template, follow these steps:
- Load the O2 (and QC if needed) environment:
alienv enter O2/latest
# or
alienv enter QualityControl/latest # will load also O2
- Go to your local ControlWorkflows repository (clone if needed), preferably create a new branch from master.
- Run the DPL command with
--o2-control <workflow-name>
argument at the end. For example:
o2-dpl-raw-proxy -b --session default \
--dataspec 'x:PHS/RAWDATA;dd:FLP/DISTSUBTIMEFRAME/0' \
--readout-proxy '--channel-config "name=readout-proxy,type=pull,method=connect,address=ipc:///tmp/stf-builder-dpl-pipe-0,transport=shmem,rateLogging=1"' \
| o2-phos-reco-workflow -b --input-type raw --output-type cells --session default --disable-root-output \
| o2-dpl-output-proxy --environment "DPL_OUTPUT_PROXY_ORDERED=1" -b --session default \
--dataspec 'A:PHS/CELLS/0;dd:FLP/DISTSUBTIMEFRAME/0' \
--dpl-output-proxy '--channel-config "name=downstream,type=push,method=bind,address=ipc:///tmp/stf-pipe-0,rateLogging=1,transport=shmem"' \
--o2-control phos-compressor
If there are no problems with the workflow, you will see a similar output:
[INFO] Dumping the workflow configuration for AliECS.
[INFO] Creating directories 'workflows' and 'tasks'.
[INFO] ... created.
[INFO] Creating a workflow dump 'phos-compressor'.
[INFO] This topology will connect to the channel 'readout-proxy', which is most likely bound outside. Please provide the path to this channel under the name 'path_to_readout_proxy' in the mother workflow.
[INFO] Creating a task dump for 'internal-dpl-clock'.
[INFO] ...created.
[INFO] Creating a task dump for 'readout-proxy'.
[INFO] ...created.
[INFO] Creating a task dump for 'PHOSRawToCellConverterSpec'.
[INFO] ...created.
[INFO] Creating a task dump for 'dpl-output-proxy'.
[INFO] This topology will bind a dangling channel 'downstream'. Please make sure that another device connects to this channel elsewhere.
[INFO] ...created.
[INFO] Creating a task dump for 'internal-dpl-injected-dummy-sink'.
[INFO] ...created.
The corresponding workflow template (list of processes to run) will be created in the workflows
directory and the tasks templates (processes configurations) will be put under the tasks
directory.
Please also add the new workflow to the dpl_workflow
list in workflows/readout-dataflow.yaml
.
- Commit the new files and push to a remote branch.
One can run it by pointing the AliECS to respective branch, choosing the
readout-dataflow
workflow in the AliECS GUI and adding the parameterdpl_workflow : <workflow_name>
in the advanced configuration. If running a setup with multiple detectors, add the 3-letter detector prefix to the key (e.g.tof_dpl_workflow
). After confirming that it works, make a PR to the main ControlWorkflows' master branch.
Some DPL workflows require configuration files to run correctly. Also the process names, channel names and their arrangement might depend on such configuration files. Quality Control workflows fall into this category. In such case, exporting templates require a bit more babysitting.
Let's consider a workflow consisting of I/O proxies, MFT decoder and MFT Digit QC Task:
o2-dpl-raw-proxy -b --session default \
--dataspec 'x:MFT/RAWDATA;dd:FLP/DISTSUBTIMEFRAME/0' \
--readout-proxy '--channel-config "name=readout-proxy,type=pull,method=connect,address=ipc:///tmp/stf-builder-dpl-pipe-0,transport=shmem,rateLogging=10"' \
| o2-itsmft-stf-decoder-workflow -b --runmft --digits --no-clusters --no-cluster-patterns \
| o2-dpl-output-proxy --environment "DPL_OUTPUT_PROXY_ORDERED=1" -b --session default \
--dataspec 'x:MFT/RAWDATA;dd:FLP/DISTSUBTIMEFRAME/0' \
--dpl-output-proxy '--channel-config "name=downstream,type=push,method=bind,address=ipc:///tmp/stf-pipe-0,rateLogging=10,transport=shmem"' \
| o2-qc -b --config json://'${QUALITYCONTROL_ROOT}'/etc/mft-digit-qc-task-FLP-0-TaskLevel-0.json
If we were to generate and use such workflow template, the QC would look for the QC configuration file under the same path as during the export, which might not be accesible on the target machine. Thus, we will substitute it with the expected path of such file in Consul (the configuration store). To do so, one may follow the script below (see the associated comments):
#!/usr/bin/env bash
set -x; # debug mode
set -e; # exit on error
set -u; # exit on undefined variable
# Variables
WF_NAME=mft-digits-qc
QC_GEN_CONFIG_PATH='json://'${QUALITYCONTROL_ROOT}'/etc/mft-digit-qc-task-FLP-0-TaskLevel-0.json'
QC_FINAL_CONFIG_PATH='consul-json://{{ consul_endpoint }}/o2/components/qc/ANY/any/'${WF_NAME}'-{{ it }}'
QC_CONFIG_PARAM='qc_config_uri'
# Generate the AliECS workflow and task templates
o2-dpl-raw-proxy -b --session default \
--dataspec 'x:MFT/RAWDATA;dd:FLP/DISTSUBTIMEFRAME/0' \
--readout-proxy '--channel-config "name=readout-proxy,type=pull,method=connect,address=ipc:///tmp/stf-builder-dpl-pipe-0,transport=shmem,rateLogging=10"' \
| o2-itsmft-stf-decoder-workflow -b --runmft --digits --no-clusters --no-cluster-patterns \
| o2-dpl-output-proxy --environment "DPL_OUTPUT_PROXY_ORDERED=1" -b --session default \
--dataspec 'x:MFT/RAWDATA;dd:FLP/DISTSUBTIMEFRAME/0' \
--dpl-output-proxy '--channel-config "name=downstream,type=push,method=bind,address=ipc:///tmp/stf-pipe-0,rateLogging=10,transport=shmem"' \
| o2-qc --config ${QC_GEN_CONFIG_PATH} -b \
--o2-control $WF_NAME
# Add the final QC config file path as a variable in the workflow template
ESCAPED_QC_FINAL_CONFIG_PATH=$(printf '%s\n' "$QC_FINAL_CONFIG_PATH" | sed -e 's/[\/&]/\\&/g')
# Will work only with GNU sed (Mac uses BSD sed)
sed -i /defaults:/\ a\\\ \\\ "${QC_CONFIG_PARAM}":\ \""${ESCAPED_QC_FINAL_CONFIG_PATH}"\" workflows/${WF_NAME}.yaml
# Find all usages of the QC config path which was used to generate the workflow and replace them with the template variable
ESCAPED_QC_GEN_CONFIG_PATH=$(printf '%s\n' "$QC_GEN_CONFIG_PATH" | sed -e 's/[]\/$*.^[]/\\&/g');
# Will work only with GNU sed (Mac uses BSD sed)
sed -i "s/""${ESCAPED_QC_GEN_CONFIG_PATH}""/{{ ""${QC_CONFIG_PARAM}"" }}/g" workflows/${WF_NAME}.yaml tasks/${WF_NAME}-*
After these steps, the workflow should be ready to be committed and tested.
However, one should also make sure that the required config file is available in Consul under the correct path.
To do so, one can add it using the Consul GUI (Key/Value panel).
Otherwise, to install it with each FLP suite, one should add a file template in the System configuration repository under the path ansible/roles/quality-control/templates
similarly to the other QC files (if it is actually QC).
Then one should add the file to the corresponding sub-task in ansible/roles/quality-control/tasks/main.yml
.
If the expected production setup includes QC running in parallel on many FLPs, one should generate two workflow templates - one for the FLP part, another one which should run on a QC server.
First, one should prepare DPL commands and QC config file according to the multinode QC setup documentation.
Then, the two workflows should be generated with -local
and -remote
name suffixes respectively, as it is done e.g. in scripts/qcmn-daq.sh
.
Following this example, the full setup can be run by adding the following parameters in the advanced configuration panel:
"dpl_workflow" : "qcmn-daq-local"
"qc_remote_workflow" : "qcmn-daq-remote"
Do not forget to add both workflows to the coresponding lists in workflows/readout-dataflow.yaml
.
In this case, the local part of the QC workflow is run on EPNs (controlled by ODC), while the remote part is still executed on QC servers (controlled directly by AliECS). Both parts should use the same version of QC and the underlying software stack.
First, please make sure that the QC config file contains valid "remoteMachine"
and "remotePort"
parameters, as they are not dynamically assigned for connections between the two control systems.
The remote machine name might need the .cern.ch
suffix.
Please use the port number between 47700 and 47799.
It is highly advised to check the connection with a simple TCP client/server application beforehand (e.g. nc
).
Also, do not forget to add "localControl" : "odc"
in the QC task configuration, which will make AliECS templates avoid dynamic resource assignement.
The EPN part will require exporting a DDS topology file, which then should be run as any other DDS topologies. Please contact the EPN team for details.
The QC server part requires an AliECS template, which should be generated similarly to the one for the FLP case.
No local counterpart is needed.
Please refer to scripts/emc-qcmn-epn.sh
as an example.
In that case, the AliECS-controlled part of the workflow can be run with the follwing parameter in the advanced configuration panel.
"qc_remote_workflow" : "emc-qcmn-epn-remote"
Currently recommended way to approach this is to combine the FLP and EPN config files and use different "localMachines"
for them.
Then, one can export the AliECS templates and DDS topologies for the local QC workflows by invoking them with different --host
parameters.
The remote part should be just one.
With the future releases we plan to allow for Just-In-Time workflow translation. It means that one will not have to export the templates manually anymore and they will only need to provide a file with the standalone DPL command.
If you have read everything above, you can now follow these simplified instructions.
- Access an FLP with the proper FLP Suite
- Prepare the workflow
- Clone ControlWorkflows from your fork:
git clone https://github.com/<yourGHusername>/ControlWorkflows.git
- Make sure that you are in line with the correct branch:
git remote add upstream https://github.com/AliceO2Group/ControlWorkflows.git git fetch upstream git checkout flp-suite-v0.xx.0 git checkout -b my-branch
- Update a script in
ControlWorkflows/scripts
or add a new one - Run the script to re-generate the workflow(s):
cd ControlWorkflows/scripts ; ./my-script.sh
- If you need to use config files, refer to this section
- Add the new workflow names to the lists in
workflows/readout-dataflow.yaml
- Commit and push the changes
- Clone ControlWorkflows from your fork:
- Test it
- Add the fork to the control:
coconut repo add github.com/<yourGHusername>/ControlWorkflows.git
- Refresh the repository (refresh button in the AliECS GUI)
- In the ECS, create a new environment.
- Set the fork and the branch to match yours.
- Add the variable
dpl_workflow
and set it to the name of the workflow - Add the variable
log_task_output
and set it toall
to make sure you can see the output of the tasks in the Infologger. - Do not enable QC but enable DD.
- Run and check that it starts and stops without failures.
- Add the fork to the control:
- Add the new scripts, if any, to
scripts/generate-all-dpl-workflows.sh