Optimize rcutils_logging_get_logger_effective_level() #381

clalancette · 2022-08-29T13:51:32Z

This function is the most expensive part of
rcutils_logging_logger_is_enabled_for(), which is called on every
RCUTILS_LOG_* invocation.

We notice a couple of things:

The current name -> logger level lookup is using a "string map"
structure. A "string map" isn't a map at all, but is really a linear
array of strings. Thus searching it can be slow when there are
a lot of strings in the map. Since this is a global map, this
can happen when many ROS 2 nodes are loaded in to the same process.
When looking up the severity of a particular name, we need to
check the full name, plus any ancestors (separated by '.') to find
the severity level. This requires a bunch of work, including
copying a string, running strlen on it, searching through it for
periods, etc. This can be expensive, and in the common case it isn't
needed; the fully-qualified string is usually in the map already.

To fix both of these, we switch to a hash map, and look up the
current logger level via the hash map. If the full logger name
isn't in the hash map, we fall back to the slow path where we
do the additional work of finding the severity via the hierarchy.
Note that even in this slow path, once we've computed the severity
level we place that in the map as well so that subsequent lookups
will take the fast path.

The benchmarks I ran on this change give very good improvements
for most cases. For cases where there are multiple set logger
names, and those logger names have ancestors, this change is
~9x faster than the current implementation. For cases where there
are multiple set logger names and those logger names have no
ancestors (one of the most common cases in ROS 2), this change is ~3x
faster than the current implementation. For cases where we
have a single logger name (another common case), this change is
~2x faster than the current implementation.

Finally, note that rcutils_logging_set_logger_level() is
more expensive than before, due to having to traverse the hash map
to find other logger names in the hierarchy. Since we don't expect
users to change logger levels often, this tradeoff seems worth it
to speed up lookups (which are called for every log message).

Signed-off-by: Chris Lalancette [email protected]

This should solve #364 (at least, as much as it can be solved given the current design).

Signed-off-by: Chris Lalancette <[email protected]>

This makes the code shorter and much more consistent. Signed-off-by: Chris Lalancette <[email protected]>

This function is the most expensive part of rcutils_logging_logger_is_enabled_for(), which is called on every RCUTILS_LOG_* invocation. We notice a couple of things: 1. The current name -> logger level lookup is using a "string map" structure. A "string map" isn't a map at all, but is really a linear array of strings. Thus searching it can be slow when there are a lot of strings in the map. Since this is a global map, this can happen when many ROS 2 nodes are loaded in to the same process. 2. When looking up the severity of a particular name, we need to check the full name, plus any ancestors (separated by '.') to find the severity level. This requires a bunch of work, including copying a string, running strlen on it, searching through it for periods, etc. This can be expensive, and in the common case it isn't needed; the fully-qualified string is usually in the map already. To fix both of these, we switch to a hash map, and look up the current logger level via the hash map. If the full logger name isn't in the hash map, we fall back to the slow path where we do the additional work of finding the severity via the hierarchy. Note that even in this slow path, once we've computed the severity level we place that in the map as well so that subsequent lookups will take the fast path. The benchmarks I ran on this change give very good improvements for most cases. For cases where there are multiple set logger names, and those logger names have ancestors, this change is ~9x faster than the current implementation. For cases where there are multiple set logger names and those logger names have no ancestors (one of the most common cases in ROS 2), this change is ~3x faster than the current implementation. For cases where we have a single logger name (another common case), this change is ~2x faster than the current implementation. Finally, note that rcutils_logging_set_logger_level() is more expensive than before, due to having to traverse the hash map to find other logger names in the hierarchy. Since we don't expect users to change logger levels often, this tradeoff seems worth it to speed up lookups (which are called for every log message). Signed-off-by: Chris Lalancette <[email protected]>

clalancette · 2022-08-29T13:52:28Z

CI:

Linux
Linux-aarch64
Windows

Also add in a test. Signed-off-by: Chris Lalancette <[email protected]>

clalancette · 2022-08-29T18:49:51Z

New CI after the latest fix:

Linux
Linux-aarch64
Windows

Signed-off-by: Chris Lalancette <[email protected]>

clalancette · 2022-08-30T01:05:11Z

Another CI with the latest Windows fix:

Linux
Linux-aarch64
Windows

clalancette · 2022-08-30T12:56:53Z

@ros-pull-request-builder retest this please

ivanpauno · 2022-09-01T18:34:06Z

I have a few questions before reviewing

When looking up the severity of a particular name, we need to
check the full name, plus any ancestors (separated by '.') to find
the severity level. This requires a bunch of work, including
copying a string, running strlen on it, searching through it for
periods, etc. This can be expensive, and in the common case it isn't
needed; the fully-qualified string is usually in the map already.

The parent logging severity is only queried if there wasn't a severity directly set for this logger, right?
If looking up for a parent severity is needed, it would be nice to set the child logger severity it in the map, so the next time the only thing needed is a query.
The only tricky thing is that if someone changes the parent logger severity, the child should also change in that case (as the severity was actually inherited from the parent), so to make sure that happens we will need to store some extra information.

IMO, everything would be more efficient if we would have a logger object, instead of only using a name.
It may be a pain to change that now, but most people are using the rclcpp/rclpy logging functions instead of this, and those already have a logger object.

clalancette · 2022-09-01T19:58:52Z

The parent logging severity is only queried if there wasn't a severity directly set for this logger, right?

That's correct.

If looking up for a parent severity is needed, it would be nice to set the child logger severity it in the map, so the next time the only thing needed is a query.

Yep! That's what this PR does now; if the parent lookup is needed, then once it is calculated the child is added into the map for subsequent lookups (see the code in rcutils_logging_get_logger_effective_level).

The only tricky thing is that if someone changes the parent logger severity, the child should also change in that case (as the severity was actually inherited from the parent), so to make sure that happens we will need to store some extra information.

Right. This PR also addresses that by walking the entire hash map when loggers change severity, looking for children of the current name being changed. For any ones that haven't been explicitly set, it drops them from the hash map until they are used again (see the code in rcutils_logging_set_logger_level).

IMO, everything would be more efficient if we would have a logger object, instead of only using a name.
It may be a pain to change that now, but most people are using the rclcpp/rclpy logging functions instead of this, and those already have a logger object.

Yes, agreed. The performance with this PR is the best we can do (I think) with a strictly string-based hierarchy like we currently have. I think it is valuable to get this in (as it will help performance today), and then something like ros2/rcl_logging#92 may make this obsolete.

ivanpauno

LGTM!

I have some minor comments

src/logging.c

Signed-off-by: Chris Lalancette <[email protected]>

clalancette · 2022-09-06T12:54:20Z

@ivanpauno Thanks for the review! I think I've addressed all of your comments; please take another look.

ivanpauno

LGTM! Great work!!

clalancette · 2022-09-06T17:53:15Z

One more CI on the latest changes:

Linux
Linux-aarch64
Windows

clalancette · 2022-09-07T15:20:07Z

The Windows CI warning is unrelated to this change, and everything else is green.

Going ahead and merging this, thanks for the review!

clalancette added 3 commits August 28, 2022 19:56

Add in tests for changing ancestor severity level.

34724ab

Signed-off-by: Chris Lalancette <[email protected]>

Revamp error handling in parse_and_create_handlers_list.

28fe626

This makes the code shorter and much more consistent. Signed-off-by: Chris Lalancette <[email protected]>

clalancette linked an issue Aug 29, 2022 that may be closed by this pull request

High logging overhead #364

Closed

Fix a bug introduced by the optimizations.

585eb74

Also add in a test. Signed-off-by: Chris Lalancette <[email protected]>

Fixes for Windows warnings.

0a49cdb

Signed-off-by: Chris Lalancette <[email protected]>

ivanpauno approved these changes Sep 5, 2022

View reviewed changes

src/logging.c Outdated Show resolved Hide resolved

src/logging.c Show resolved Hide resolved

src/logging.c Show resolved Hide resolved

clalancette added 2 commits September 6, 2022 12:47

Fix up error checking for tokens.

14b87ab

Signed-off-by: Chris Lalancette <[email protected]>

Add in a note about the LSB of the logging levels.

1575773

Signed-off-by: Chris Lalancette <[email protected]>

ivanpauno approved these changes Sep 6, 2022

View reviewed changes

clalancette merged commit 1ed4d7f into rolling Sep 7, 2022

clalancette deleted the clalancette/rcutils-get-logger-level-optimize branch September 7, 2022 15:20

clalancette mentioned this pull request Oct 12, 2022

Added initial version of logging.md file. ros2/design#315

Open

wjwwood mentioned this pull request Oct 14, 2022

regression of thread-safety for logging macros #393

Merged

clalancette mentioned this pull request Oct 26, 2022

Make logging functionality truly thread-safe #397

Open

jrutgeer mentioned this pull request Feb 17, 2023

Superfluous va_copy call in rcutils_char_array_vsprintf #409

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize rcutils_logging_get_logger_effective_level() #381

Optimize rcutils_logging_get_logger_effective_level() #381

clalancette commented Aug 29, 2022

clalancette commented Aug 29, 2022

clalancette commented Aug 29, 2022

clalancette commented Aug 30, 2022

clalancette commented Aug 30, 2022

ivanpauno commented Sep 1, 2022

clalancette commented Sep 1, 2022

ivanpauno left a comment

clalancette commented Sep 6, 2022

ivanpauno left a comment

clalancette commented Sep 6, 2022 •

edited

Loading

clalancette commented Sep 7, 2022

Optimize rcutils_logging_get_logger_effective_level() #381

Optimize rcutils_logging_get_logger_effective_level() #381

Conversation

clalancette commented Aug 29, 2022

clalancette commented Aug 29, 2022

clalancette commented Aug 29, 2022

clalancette commented Aug 30, 2022

clalancette commented Aug 30, 2022

ivanpauno commented Sep 1, 2022

clalancette commented Sep 1, 2022

ivanpauno left a comment

Choose a reason for hiding this comment

clalancette commented Sep 6, 2022

ivanpauno left a comment

Choose a reason for hiding this comment

clalancette commented Sep 6, 2022 • edited Loading

clalancette commented Sep 7, 2022

clalancette commented Sep 6, 2022 •

edited

Loading