mrc-5048: Add request ID into logs #38

r-ash · 2024-02-22T10:08:30Z

@plietar has added request_id into Rust API in this mrc-ide/outpack_server#58, if a request is received with header x-request-id it will be included in log messages, if one not sent a request ID will be generated and included in the logs.

This PR adds the same pattern so we can use it across all our R APIs. Hopefully with this, if we send a request from Kotlin backend to any of our services we can trace it through with this ID.

This PR will

Update github actions workflows (these were a fair bit out of date)
If request includes a header x-request-id it will return it in the response header. If one not sent, it will generate a uuid.
If logger is being used, it will add add request ID into all log messages
If API author uses the logger which porcelain configures (i.e. the logger with name equal to the package name), it will also add the request ID into all logs from this

r-ash · 2024-02-22T10:10:24Z

R/filter.R

@@ -1,5 +1,6 @@
-porcelain_filters <- function(req, res) {


These req, res args were not being used

r-ash · 2024-02-22T10:12:27Z

inst/examples/add.chatty/R/api.R

@@ -0,0 +1,19 @@
+add <- function(a, b) {
+  logger <- lgr::get_logger("add.chatty")


Note how users will have to get the logger if they want to use the request ID (and all other things configured by porcelain). They could also pass this through from when the logger is created in the api. We could add a function to make this easier.

r-ash · 2024-02-22T10:14:20Z

tests/testthat/test-roxygen.R

@@ -99,7 +99,7 @@ test_that("Nice error on parse failure", {
  err <- expect_error(
    roxygen2::parse_text(text))
  expect_match(
-    err$message,
+    err$parent$message,


Think this has changed due to porcelain update

So that the request ID is available to logs whilst endpoint is being run

codecov · 2024-02-22T10:56:57Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (18511c4) to head (6c35bd6).

Additional details and impacted files

@@            Coverage Diff            @@
##            master       #38   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           19        19           
  Lines          975       992   +17     
=========================================
+ Hits           975       992   +17

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

plietar

Very nice. Just some minor points

R/filter.R

tests/testthat/test-input.R

Co-authored-by: Paul Liétar <[email protected]>

M-Kusumgar

LGTM, tiny optional comment

R/filter.R

Co-authored-by: M-Kusumgar <[email protected]>

richfitz · 2024-02-22T11:55:12Z

R/logging.R

@@ -1,3 +1,5 @@
+LOG_FILTER_REQUEST_ID_NAME <- "request_id" # nolint


The advantage of this constant seem reduced by (I'm guessing) it needing to match the name used in FilterInject. You could use the .list argument there and do

FilterInject$new(.list = set_names(request_id, LOG_FILTER_REQUEST_ID_NAME))

which would reuse this constant at least

richfitz · 2024-02-22T11:59:56Z

R/filter.R

+    req$REQUEST_ID <- request_id
+    res$setHeader("x-request-id", request_id)
+    if (!is.null(logger)) {
+      logger$add_filter(lgr::FilterInject$new(request_id = request_id),


This interaction with lgr feels like it's going to have performance issues each time (based on my previous experience with the package). Each request we'll be doing

add_filter

FilterInject creation

remove_filter

I presume this means that when the request comes through we're checking to see "do we have the appropriate header" then if so adding additional data to the log output? Any reason this can't be done directly in porcelain_log_postserialize where we have req already? It's possible I'm missing something here though.

OTOH, it might be worth looking to see how bad this is performance wise - see if it's actually an issue with a quick benchmark with and without the header being present and see how long a roundtrip takes?

We chatted about this in person but just putting this down here for future reference.

We can't just use porcelain_log_postserialize as we want to add the request ID into the logger so if a user writes a log call in their API impl it will pick up the request ID.

With plumber we have 4 hooks we can use preroute, postroute, preserialize and posterialize. In preroute we don't have a handle on the "HTTP_X_REQUEST_ID" so we need to do this after the preroute, but by actual code endpoint is run before we get to postroute. So we have to handle this in a filter.

I did some benchmarking on this, by running up the API in a background process with the add.chatty package I added in this PR. There isn't a huge amount of variation in timings

With no filter

rashton@wpia-dide270:~/projects/porcelain$ ./timeit curl -w "@curl_format.txt" -o /dev/null -s "http://localhost:8552/?a=3&b=3" Average: 0.00569371 N: 1000

With the filter which creates and removes the object every time

rashton@wpia-dide270:~/projects/porcelain$ ./timeit curl -w "@curl_format.txt" -o /dev/null -s "http://localhost:9120/?a=3&b=3" Average: 0.0055903 N: 1000

With updated filter which just changes the request ID on an the filter object which exists in cache

rashton@wpia-dide270:~/projects/porcelain$ ./timeit curl -w "@curl_format.txt" -o /dev/null -s "http://localhost:9343/?a=3&b=3" Average: 0.00546013 N: 1000

So very similar average times, less different than I was expecting. Feels nice to not have to create the object every time so have gone with that but previous impl I think read a little more simply so we can always revert back.

r-ash added 2 commits February 19, 2024 18:17

Add filter to add request ID into logs

0c54dfc

Add request ID into response header

c227123

r-ash commented Feb 22, 2024

View reviewed changes

R/filter.R

@@ -1,5 +1,6 @@

porcelain_filters <- function(req, res) {

Copy link

Member Author

r-ash Feb 22, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These req, res args were not being used

r-ash commented Feb 22, 2024

View reviewed changes

Add request ID to logger filter in plumber filter

6a0ac9d

So that the request ID is available to logs whilst endpoint is being run

r-ash force-pushed the mrc-5048 branch from 76e3a0e to 6a0ac9d Compare February 22, 2024 10:14

r-ash requested review from richfitz and M-Kusumgar February 22, 2024 10:16

r-ash added 5 commits February 22, 2024 10:20

Ignore identation lintr issues

5c42ea9

Update github actions

ea2f883

Fix lint issues

6b68e1e

Add missing import

1d1bea2

Update roxygen, fix test failures

a78a81e

plietar reviewed Feb 22, 2024

View reviewed changes

R/filter.R Outdated Show resolved Hide resolved

tests/testthat/test-input.R Outdated Show resolved Hide resolved

tests/testthat/test-input.R Outdated Show resolved Hide resolved

r-ash and others added 3 commits February 22, 2024 13:57

Update tests/testthat/test-input.R

f433bf1

Co-authored-by: Paul Liétar <[email protected]>

Update tests/testthat/test-input.R

2ca231c

Co-authored-by: Paul Liétar <[email protected]>

Use unpacked HTTP header instead of raw headers on the request

5b58544

plietar approved these changes Feb 22, 2024

View reviewed changes

M-Kusumgar approved these changes Feb 22, 2024

View reviewed changes

R/filter.R Outdated Show resolved Hide resolved

Use util to improve code quality

2b7b77b

Co-authored-by: M-Kusumgar <[email protected]>

richfitz reviewed Feb 27, 2024

View reviewed changes

r-ash added 2 commits February 28, 2024 16:30

Avoid creating the filter object every time

5bc5742

Make logger required

6c35bd6

r-ash requested a review from richfitz February 29, 2024 10:43

richfitz approved these changes Feb 29, 2024

View reviewed changes

r-ash merged commit 655da35 into master Feb 29, 2024
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mrc-5048: Add request ID into logs #38

mrc-5048: Add request ID into logs #38

r-ash commented Feb 22, 2024 •

edited

Loading

r-ash Feb 22, 2024

r-ash Feb 22, 2024

r-ash Feb 22, 2024

codecov bot commented Feb 22, 2024 •

edited

Loading

plietar left a comment

M-Kusumgar left a comment

richfitz Feb 22, 2024

richfitz Feb 22, 2024

r-ash Feb 28, 2024 •

edited

Loading

		@@ -0,0 +1,19 @@
		add <- function(a, b) {
		logger <- lgr::get_logger("add.chatty")

		@@ -1,3 +1,5 @@
		LOG_FILTER_REQUEST_ID_NAME <- "request_id" # nolint

mrc-5048: Add request ID into logs #38

mrc-5048: Add request ID into logs #38

Conversation

r-ash commented Feb 22, 2024 • edited Loading

r-ash Feb 22, 2024

Choose a reason for hiding this comment

r-ash Feb 22, 2024

Choose a reason for hiding this comment

r-ash Feb 22, 2024

Choose a reason for hiding this comment

codecov bot commented Feb 22, 2024 • edited Loading

Codecov Report

plietar left a comment

Choose a reason for hiding this comment

M-Kusumgar left a comment

Choose a reason for hiding this comment

richfitz Feb 22, 2024

Choose a reason for hiding this comment

richfitz Feb 22, 2024

Choose a reason for hiding this comment

r-ash Feb 28, 2024 • edited Loading

Choose a reason for hiding this comment

r-ash commented Feb 22, 2024 •

edited

Loading

codecov bot commented Feb 22, 2024 •

edited

Loading

r-ash Feb 28, 2024 •

edited

Loading