Skip to content

Athena Python Configuration

Introduction

This guide describes the Athena Python configuration system that is in use since Run-3. It is also known as the ComponentAccumulator based configuration and the core code lives in the package Control/AthenaConfiguration.

The configuration system of Athena allows us to assemble and customize components (algorithms, services and tools) for a workflow. It is done by creation and manipulation of python objects. The result of the configuration process is a set of settings that are used by Athena to instantiate and appropriately configure C++ components. Even in a small workflow the number of settings can easily be a few hundreds and in a complete reconstruction or HLT setup the number of settings reaches tens of thousands.

For help, contact: atlas-sw-jobconfig@cern.ch

Overview

In broad terms the configuration system works as follows.

  1. Each C++ component defines a set of configurable parameters via Gaudi::Property members or (deprecated) declareProperty calls. *️⃣
    • There is rich, yet restricted set of types of configurable parameters that are supported.
    • Typically reasonable default values are defined as well.
  2. During compilation a Python "Configurable database" is built containing Python classes for each C++ class with attributes corresponding to the C++ properties.
  3. Python scripts instantiate classes and set the relevant properties. *️⃣
  4. The resulting set of python objects produce a serialized/textual representation of the configuration.
  5. The configuration is read by the C++ program and populates the global job options catalog.
  6. During initialize the C++ components fetch their respective properties from the catalog.

The steps marked with *️⃣ are places where developer intervention is needed. The other steps are fully automatized.

Conventions

For consistency, and to make a clear distinction between legacy configuration files and the Component Accumulator (CA) system, the following conventions are used:

Naming

  1. Files containing CA configuration code must end with Config.py, e.g. HelloWorldConfig.py.
  2. The name of the function generating the configuration fragment (i.e creating a ComponentAccumulator instance) should normally start with the name of the main component they configure and must end with Cfg.
  3. If the file is supposed be executed by athena.py add a shebang to the first line (#!/usr/bin/env athena). This allows athena to recognize the file as a CA configuration as opposed to legacy job options.

Arguments

  1. Configuration functions should always accept an instance of configuration flags as first argument (even if they don't actually rely on flags) e.g.
    def myComponentCfg(flags, **kwargs):
    
  2. Optionally, you may choose to pass a default name as a second argument. This can be useful if you need to have multiple versions of a component. However, it is not normally recommended for private tools (see later), as for private tools the name is not really used.
    def myComponentCfg(flags, name = "MyComponentVersionA", **kwargs):
    
  3. Other named arguments (positional arguments are not recommended) can also be required, with defaults if necessary. Examples of this would be were clients always need to specify some extra information for the configuration to be possible (inputTracks, in the example below).
    def Trig_TRT_TrackExtensionAlgCfg(flags, inputTracks,
                                      name = 'TrigTrackExtensionAlg', **kwargs):
    
  4. Where the configuration function is configuring one component (i.e. it is named MyComponentCfg) we would normally pass kwargs as the last argument to configure the properties of that component, e.g. TrackSlimmerCfg:
    def TrackSlimmerCfg(flags, name="TrackSlimmer", **kwargs):
    acc = ComponentAccumulator()
    if "TrackSlimmingTool" not in kwargs:
        from TrkConfig.TrkTrackSlimmingToolConfig import TrackSlimmingToolCfg
        TrackSlimmingTool = acc.popToolsAndMerge(TrackSlimmingToolCfg(flags))
        acc.addPublicTool(TrackSlimmingTool)
        kwargs.setdefault("TrackSlimmingTool", TrackSlimmingTool)
    acc.addEventAlgo(CompFactory.Trk.TrackSlimmer(name, **kwargs))
    return acc
    
  5. For specialised configs of a single component where no additional customisation of the configurations is expected beyond what is laid out in the configuration method (e.g. for concrete trigger, muon, EGamma specific tools and algorithms), the kwargs argument can be potentially omitted from the inputs of the configuration function. A baseline configuration function of this component should be provided in that case, including kwargs as input, and ideally used in the specialised config. For example Trig_SCT_ClusteringToolCfg:
    from SCT_ConditionsTools.SCT_ConditionsToolsConfig import SCT_ConditionsSummaryToolCfg
    
    def Trig_SCT_ClusteringToolCfg(flags, name="Trig_SCT_ClusteringTool"):
       acc = ComponentAccumulator()
       conditionsTool = acc.popToolsAndMerge(SCT_ConditionsSummaryToolCfg(
           flags, withFlaggedCondTool=False, withTdaqTool=False))
    
       acc.setPrivateTools(acc.popToolsAndMerge(SCT_ClusteringToolCfg(
           flags, name,
           conditionsTool=conditionsTool, SCTDetElStatus="")))
    
       return acc
    
  6. For more complicated functions that configure a set of components (and therefore it is not clear which properties are being configured), kwargs should not be used.

Return values

  1. Configuration functions should always return an instance of ComponentAccumulator (even if they configure just one private AlgTool).

You can find some more examples in the later section which describes in detail how to write configuration methods.