Newer
Older
2. [HEPscore23Beta Benchmark](#hepscore23beta-benchmark)
3. [Downloading and Installing HEPscore](#downloading-and-installing-hepscore)
4. [Dependencies](#dependencies)
5. [Configuring HEPscore](#configuring-hepscore)
1. [Parameters](#parameters)
6. [Feedback and Support](#feedback-and-support)
The HEPscore application orchestrates the execution of user-configurable
benchmark suites based on individual benchmark containers.
It runs the specified benchmark containers in sequence, collects their
results, and computes a final overall score.
For instructions on how to run the HEPScore23 benchmark, please refer to the [dedicated HEPiX Benchmark page](https://w3.hepix.org/benchmarking/how_to_run_HS23.html). The HEPScore23 scores for the benchmarked servers are reported in this [table](https://w3.hepix.org/benchmarking/scores_HS23.html).
HEPscore is specifically designed for use with containers from the
[HEP Workloads project](https://gitlab.cern.ch/hep-benchmarks/hep-workloads).
However, any benchmark containers stored in a Docker/Singularity
registry, or filesystem directory, which conform to the HEP Workloads'
output JSON schema, are potentially usable.
Both Singularity (including Apptainer and SingularityCE) and Docker are
supported for container execution. While Podman is not yet officially
supported, tests have shown that it is functional with HEPscore and workloads
from the HEP Workloads project as long as a "docker" symlink exists to the
"podman" binary, and the open file descriptor ulimit can be set to 100k or
higher (this may require administrator privileges).
By default, if no configuration is passed to HEPscore, the "HEPscore23Beta"
benchmark is run.
The HEPscore value for a given configuration is computed as the geometric
mean of the performance scores measured on a given server when running each
workload included in that configuration. (Note: the workload performance score
is in general expressed as the event throughput of the workload process.)
Each workload score is normalised to the score of the reference server, in
order to make it a dimensionless factor. The reference server model is
reported in the configuration settings (check for key “reference_machine").
The reference server for HEPscore23 is:
“Intel CPU Gold 6326 CPU @ 2.90GHz - 64 cores SMT ON"
The normalised scores are then averaged, using the geometric mean, and are
rescaled to the scale value reported in the configuration settings
(check for key “scaling”). The obtained value is the HEPscore score.
Therefore, by definition, the scaling value is the HEPscore score of the
reference server. The scaling value for HEPscore23 is defined as the HS06
(32 bits) score value for the very same reference server (HS06 = 1018).
## HEPscore23Beta Benchmark
HEPscore23Beta is a benchmark based on containerized HEP workloads that
the HEPiX Benchmarking Working Group is targeting to eventually replace
HEPSPEC06 as the standard HEPiX/WLCG benchmark. It is currently in a
proof of concept development state, and consists of the following workloads
from the
[HEP Workloads project](
https://gitlab.cern.ch/hep-benchmarks/hep-workloads):
atlas-gen_sherpa-ma-bmk
atlas-reco_mt-ma-bmk
cms-gen-sim-run3-ma-bmk
cms-reco-run3-ma-bmk
lhcb-gen-sim-2021-bmk
belle2-gen-sim-reco-2021-bmk
alice-digi-reco-core-run3-bmk
You can view the YAML HEPscore configuration for HEPscore23Beta by
executing ```hep-score -p```.
The benchmark will take 5+ hours to execute on modern hardware.
**NOTE**: ~20 GB of free disk space in your Singularity or Docker
cache area, and 320 MB/core of free space (e.g. 20 GB on 64 core host)
in the specified OUTDIR output directory is necessary to run the
HEPscore23Beta benchmark. If passed the ```-c``` (clean images) and
```-C``` (clean files) command line options, hep-score will clean
the benchmark container images and output after execution, which will
greatly reduce the amount of space needed to run.
It is also possible to run the benchmark containers out of the
"unpacked.cern.ch" CVMFS repo instead of the CERN gitlab Docker registry,
by passing ```hep-score``` the
[hepscore-cvmfs.yaml](hepscore/etc/hepscore23-cvmfs.yaml)
file shipped in the application's etc/ directory. When running the
benchmark using the unpacked images in CVMFS, the Singularity cache area
is not utilized.
## Downloading and Installing HEPscore
HEPscore must be installed using pip (<https://pypi.org/project/pip/>).
To install as a regular user (suggested):
```$ pip install --user git+https://gitlab.cern.ch/hep-benchmarks/hep-score.git```
The ```hep-score``` script will then be accessible under ```~/.local/bin```.
If you have administrator rights on your system and would like to install
hep-score and all dependencies in system Python/bin paths:
```# pip install git+https://gitlab.cern.ch/hep-benchmarks/hep-score.git```
Alternatively, you can clone the hep-score git repository, and run the pip
installation out of the root directory of the repo:
$ git clone https://gitlab.cern.ch/hep-benchmarks/hep-score.git
$ cd hep-score
$ pip install --user .
**NOTE**: on RHEL/CentOS/Scientific Linux 7 hosts, where python 3 is not
the default python installation, it may be necessary to use ```pip3``` to
install instead of ```pip```.
Release tarfiles containing the Python wheel for the hepscore package, as
well as all dependency wheels, are available/published in the
[HEPscore release documentation](https://gitlab.cern.ch/hep-benchmarks/hep-score/-/releases).
An archive of all released wheel tarfiles is also available here:
<https://hep-benchmarks.web.cern.ch/hep-score/releases/>.
These wheels can be used to install HEPscore via pip on hosts without
network connectivity. To install, after downloading and untarring a
release tarfile, execute ```pip install --user hepscore_wheels/*.whl```.
Christopher Henry Hollowell
committed
## Dependencies
HEPscore requires a **Python 3.6+** installation. The pip installation will pull
in all python module dependencies. HEPscore should be used with **Singularity
3.5.3 and newer**, or **Docker 1.13 and newer**. There are some known issues
when using HEPscore with earlier Singularity and Docker releases.
**NOTE**: if you are running hep-score with setuid-enabled Singularity/Apptainer
(instead of user namespace-based execution) it may be necessary to increase the
"sessiondir max size" setting to "64" MB in your system singularity/apptainer
configuraton file on many-core hosts.
usage: hep-score [-h] [-m [{singularity,docker}]] [-S] [-c] [-C]
[-f [CONFFILE]] [-l] [-n [NAMEDCONF]] [-r] [-o [OUTFILE]]
[-y] [-p] [-V] [-v]
positional arguments:
OUTDIR Base output directory.
optional arguments:
-h, --help show this help message and exit
-m [{singularity,docker}], --container_exec [{singularity,docker}]
specify container platform for benchmark execution
(singularity [default], or docker).
-S, --userns enable user namespace for Singularity, if supported.
-c, --clean clean residual container images from system after run.
-C, --clean_files clean residual files & directories after execution.
Tar results.
-f [CONFFILE], --conffile [CONFFILE]
custom config yaml to use instead of default.
-l, --list list built-in benchmark configurations and exit.
-n [NAMEDCONF], --namedconf [NAMEDCONF]
use specified named built-in benchmark configuration.
-r, --replay replay output using existing results directory OUTDIR.
-o [OUTFILE], --outfile [OUTFILE]
specify summary output file path/name.
-y, --yaml create YAML summary output instead of JSON.
-V, --version show program's version number and exit
-v, --verbose enables verbose mode. Display debug messages.
Run benchmarks via Docker, and display verbose information:
$ hep-score -v -m docker ./testdir
Run using Singularity (default) with a custom benchmark configuration:
$ hep-score -f /tmp/my-custom-bmk.yml /tmp
List built-in benchmark configurations:
$ hep-score -l
Run with a specified built-in benchmark configuration:
$ hep-score -n hepscore_testkv /tmp
Singularity will be used as the container engine for the run, unless Docker
is specified on the hep-score commmandline (```-m docker```), or in the
benchmark configuration.
hep-score creates a HEPscore_DATE_TIME named directory under OUTDIR which
is used as the working directory for the sub-benchmark containers. A detailed
log of the run of the application is also written to this directory:
BENCHMARK_NAME.log, where BENCHMARK_NAME is taken from the "name" parameter in
the YAML configuration ("HEPscore23Beta.log" by default).
The final computed score will be printed to stdout ("Final score: XYZ"), and
also stored in a summary output JSON (or YAML, if ```-y``` is specified) file
under OUTDIR (unless an alternative location is specified with ```-o```). This
file also contains all of the summary JSON output data from each sub-benchmark.
An example hepscore YAML configuration is below:
hepscore_benchmark:
benchmarks:
cms-reco-bmk:
results_file: cms-reco_summary.json
weight: 1.0
version: v2.1
results_file: lhcb-gen-sim_summary.json
ref_scores:
gen-sim: 90.29
weight: 1.0
version: v2.1
threads: 1
events: 5
settings:
name: TestBenchmark
reference_machine: "CPU Intel(R) Xeon(R) CPU E5-2630 v3 @ 2.40GHz"
registry: docker://gitlab-registry.cern.ch/hep-benchmarks/hep-workloads
method: geometric_mean
repetitions: 3
scaling: 355
container_exec: singularity
```
All configuration parameters must be under the "hepscore_benchmark" key.
Specifies a list of benchmark containers to run, with associated settings:
##### BENCHMARK_NAME (required)
DICTIONARY; where BENCHMARK_NAME is variable
BENCHMARK_NAME denotes the name of the benchmark container to run. See
per-benchmark settings below
###### ref_scores (required)
List of sub-scores to collect from the benchmark container output
JSON, with reference scores from the specified "reference_machine". Each
sub-score is divided by its reference score, and the geometric mean is
taken from the results to compute a final score for the benchmark
The version of the benchmark container to execute
###### results_file
STRING ; defaults to BENCHMARK_NAME.json, where BENCHMARK_NAME
is the name of the benchmark container.
The name of the benchmark container results JSON file.
DICTIONARY; default set by container
Set options to pass to the benchmark container. Typically, ```events``` to
set the number of events, ```threads``` for the number threads, ```copies```
for the number of copies, and ```debug```to enable debugging output
###### gpu
BOOL; default = false
Enable GPU support in Singularity/Docker call.
Sets the weight for the benchmark container when calculating the overall
score
STRING; defaults to the primary registry specified in "settings"
Allows for overriding the registry to use for this container. See
"registry", under "settings" below, for more information
Defines overall benchmark application settings/information below
The name of the overall benchmark/configuration
##### reference_machine (required)
Specifies host/configuration where the benchmark container scores have
been normalized to 1.0. All reported scores are then relative to the
The registry to run containers from. Multiple URIs are permitted:
```docker://``` to specify a Docker registry, ```dir://``` (Singularity
only) to specify a local directory containing unpacked images or image
files, ```shub://``` (Singularity only) to specify a Singularity
registry, ```oras://``` (Singularity only) to specify an OCI registry,
or ```https://``` (Singularity only) to specify an HTTPS image
repository.
The method for combining the sub-benchmark scores into a single score.
Currently only "geometric_mean" is supported. This is actually a
weighted geometric mean, with weights taken from 'weight' configuration
parameter for each benchmark.
##### repetitions (required)
INTEGER
The number of times each benchmark container should be run. If the
number of runs is greater than one, the median of that container's
The multi-benchmark score calculated via the "method" parameter is
multiplied by this value to compute the final score
STRING; defaullt = "singularity"
Allows one to specify the default container execution platform:
"singularity" and "docker" are supported. This can be overridden on the
INTEGER; default = 0
Specifies how many times to retry running a container if it fails.
BOOL; default = false
Defines whether hep-score should continue attempting to run other
sub-benchmarks, if a sub-benchmark fails and does not produce a
resulting score. If true, and this condition occurs, an overall/final
score will *not* be reported by the application, but runs of other
sub-benchmarks will continue. This parameter is primarily useful for
testing and debugging purposes
##### addarch
BOOL; default = false
Append architecture to container version tag if Singularity/Apptainer are
being used.
## Feedback and Support
Feedback and support questions are welcome primarily through [GGUS tickets](https://w3.hepix.org/benchmarking/how_to_run_HS23.html#how-to-open-a-ggus-ticket) or in the HEP Benchmarks Project
[Discourse Forum](https://wlcg-discourse.web.cern.ch/c/hep-benchmarks).
| **qa-v1.0** | [![pipeline status](https://gitlab.cern.ch/hep-benchmarks/hep-score/badges/qa-v1.0/pipeline.svg)](https://gitlab.cern.ch/hep-benchmarks/hep-score/commits/qa-v1.0) | ![code quality](https://gitlab.cern.ch/hep-benchmarks/hep-score/-/jobs/artifacts/qa-v1.0/raw/public/badges/pep8.svg?job=pep8) |
| **master** | [![pipeline status](https://gitlab.cern.ch/hep-benchmarks/hep-score/badges/master/pipeline.svg)](https://gitlab.cern.ch/hep-benchmarks/hep-score/commits/master) | ![code quality](https://gitlab.cern.ch/hep-benchmarks/hep-score/-/jobs/artifacts/master/raw/public/badges/pep8.svg?job=pep8) |