Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rewrite filter section #246

Merged
merged 1 commit into from
Jan 17, 2024
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
96 changes: 63 additions & 33 deletions docs/basics-tasks/FilteringPartitioning.md
Original file line number Diff line number Diff line change
Expand Up @@ -34,34 +34,65 @@

```cpp
struct MyTask : AnalysisTask {
Filter<Tracks> ptFilter = track::pt > 1;
Filter ptFilter = track::pt > 1.f;

void process(Tracks const &filteredTracks) {
void process(soa::Filtered<Tracks> const &filteredTracks) {
for (auto& track : filteredTracks) {
}
}
};
```

filteredTracks will contain only the tracks in the table which pass the condition `track::pt > 1`.
filteredTracks will contain only the tracks in the table which pass the condition `track::pt > 1.f`.

Filter is automatically associated with a compatible table, i.e. table that has all the columns that are present in the filter expression.

Check failure on line 48 in docs/basics-tasks/FilteringPartitioning.md

View workflow job for this annotation

GitHub Actions / PR formatting / whitespace

Trailing spaces

Remove the trailing spaces at the end of the line.
It is currently impossible to have a filter that uses columns from more than one table.
If there are several filters declared, that are compatible with a single table, they are combined with logical "and".

### Expressions
`Filters` and `Partitions` are defined using `expressions`, as in the example above, which are C++ expressions that can contain _columns_, numerical and boolean _constants_, single-valued _configurables_, a limited set of special _functions_ and _branching_ conditions. The expressions are compiled on-the-fly to be applied and therefore require _explicit_ types to be used. For example in the expression `track::pt > 1.f` the numerical constant is marked to be `float`, while a simple `1` would be parsed as `int` and `1.` would be parsed as `double`.

Check failure on line 53 in docs/basics-tasks/FilteringPartitioning.md

View workflow job for this annotation

GitHub Actions / PR formatting / whitespace

Trailing spaces

Remove the trailing spaces at the end of the line.

`Filters` and `Partitions` require the expression result to be `boolean`.

Check failure on line 55 in docs/basics-tasks/FilteringPartitioning.md

View workflow job for this annotation

GitHub Actions / PR formatting / whitespace

Trailing spaces

Remove the trailing spaces at the end of the line.

The following funcitons and operations are permitted in expressions. Each `expr1` and `expr2` can be a single column, constant, configurable, condition, or a valid expression. Previously defined filters can also be used by name. Parentheses can be used as usual.

| Operation | Syntax | Description |
|:----------|:----------|:----------|
| Logical "and" | `expr1 && expr2` | logical multiplication of two expressions (both boolean) |
| Logical "or" | `expr1 \|\| expr2` | logical addition of two expressions (bothboolean) |
| Addition | `expr1 + expr2` | Addition of two expressions (compatible types) |
| Subtraction | `expr1 - expr2` | Subtraction of two expressions (compatible types) |
| Division | `expr1 / expr2` | Division of two expressions (compatibel types) |
| Multiplication | `expr1 * expr2` | Multiplication of two expressions (compatible types) |
| Bitwise "and" | `expr1 & expr2` | Bitwise logical multiplication (equally-sized integer types) |
| Bitwise "or" | `expr1 \| expr2` | Bitwise logical addition (equally-sized integer types) |
| Bitwise "xor" | `expr1 ^ expr2` | Bitwise exclusive logical multiplication (equally-sized integer types) |
| Less than | `expr1 < expr2` | Comparison (compatible types) |
| Less than or equal | `expr1 <= expr2` | -- |
| Greater than | `expr1 > expr2` | -- |
| Greater than or equal | `expr1 >= expr2` | -- |
| Equal | `expr1 == expr2` | -- |
| Not equal | `expr1 != expr2` | -- |
| Atan2 | `natan2(expr1, expr2)` | 2-argument arctangent function |
| Power | `npow(expr, constant)` | Exponentiation |
| Square root | `nsqrt(expr)` | Square root function |
| Exponent | `nexp(expr)` | Exponent function |
| Logarithm | `nlog(expr)` | Natural logarithm function |
| Base-10 logarithm | `nlog10(expr)` | Base-10 logarithm function |
| Sine | `nsin(expr)` | Sine function |
| Cosine | `ncos(expr)` | Cosine function |
| Tangent | `ntan(expr)` | Tangent function |
| Arcsine | `nasin(expr)` | Arcsine function |
| Arccosine | `nacos(expr)` | Arccosine function |
| Arctangent | `natan(expr)` | Arctangent function |
| Absolute value | `nabs(expr)` | Absolute value function |
| Rounding | `nround(expr)` | Rounding function |
| Bitwise "not" | `bitwise_not(expr)` | Bitwise logical "not" (integers) |
| Branching condition | `ifnode(expr_condition, expr_if_true, expr_if_false)` | Conditional expression, depending if the `expr_condition` is true or false, the result of this expression is either `expr_if_true` or `expr_if_false` |

Note that while normal function can be used in expressions, they are **evaluated on construction** so they are equivalent to numerical constants. All the specially declared functions that can act on `expressions` start with `n`.

Check failure on line 93 in docs/basics-tasks/FilteringPartitioning.md

View workflow job for this annotation

GitHub Actions / PR formatting / whitespace

Trailing spaces

Remove the trailing spaces at the end of the line.

You can specify multiple filters which will be applied in a sequence effectively resulting in the intersection of all them.

You can also specify filters on associated quantities:

```cpp
struct MyTask : AnalysisTask {
Filter<Collisions> collisionFilter = max(track::pt) > 1;

void process(Collisions const &filteredCollisions) {
for (auto& collision: filteredCollisions) {
...
}
}
};
```

will process all the collisions which have at least one track with `pt > 1`.

## Partitioning your inputs

Expand All @@ -75,21 +106,25 @@
Partition<Tracks> rightTracks = track::eta >= 0;

void process(Tracks const &tracks) {
for (auto& left : leftTracks(tracks)) {
for (auto& right : rightTracks(tracks)) {
for (auto& left : leftTracks) {
for (auto& right : rightTracks) {
...
}
}
for (auto& track : tracks) {
...
}
}
};
```

i.e. `Filter` is applied to the objects before passing them to the `process` method, while `Select` objects can be used to do further reduction inside the `process` method itself.
`Filter` is applied to the objects before passing them to the `process` method, while `Partitions` exist independently.

## Filtering and partitioning together

It is also possible to filter and partition data in the same task. Therefore, multiple `Filter`s are combined using the logical `AND`. These filters then are combined by a logical `AND` with all the specified selections `Select`, which themself are combined by logical `OR`s. E.g., `(Filter1 && Filter2) && (Select1 || Select2)`.
It is possible to use filters and partition data in the same task. Filters are applied as usual, combined with logical "and", therefore `tracks` table will only have entries that satisfy the filter conditions. The partitions, however, are _independent_ and _not grouped_ (by collision, in this example).

Check failure on line 125 in docs/basics-tasks/FilteringPartitioning.md

View workflow job for this annotation

GitHub Actions / PR formatting / whitespace

Trailing spaces

Remove the trailing spaces at the end of the line.

One can also define a partition _over a filtered type_, `Partition<soa::Filtered<aod::Tracks>> part = nabs(track::eta) < 1.f`. Doing this will put the partition selection on top of whatever filter selections are also present in the same task.
```cpp
using namespace o2::aod;

Expand All @@ -111,22 +146,17 @@

## Configuring filters

One of the features of the current framework is the ability to customize on the fly cuts and selection. The idea is to allow that by having a `configurable("mnemonic-name-of-the-parameter")` helper which can be used to refer to configurable options. The previous example will then become:
One of the features of the current framework is the ability to customize on the fly cuts and selection. Single-valued configurabled can be used in filter expressions directly, with some caveats. Configurables are defined to be implicitly convertable to their underlying type, however you do not want that to happen in your filter expressions as it would substitute its default value. It is possible to use the `.node()` method of the configurable in the expression, to ensure that it creates a placeholder node.

```cpp
struct MyTask : AnalysisTask {
Filter<Collisions> collisionFilter = max(track::pt) > configurable<float>("my-pt-cut");
Configurable<float> myPtCut{"my-pt-cut", 10.f, "pt cut"};
Filter trackFilter = track::pt >= myPtCut;

void process(Collision const& collision, soa::Filtered<Tracks> const& filteredTracks) {

void process(Collisions const &filteredCollisions) {
for (auto& collision: filteredCollisions) {
...
}
}
};
```

```todo
- Complete list of methods related to filters and partitions
```

See also tutorials [Data Selection](../tutorials/dataSelection.md).
Loading