fix(agent): Additional checks needed before iterating backwards in opcodes. #639

zsistla · 2023-03-13T20:46:28Z

nr_php_observer_attempt_call_cufa_handler is a new function with oapi. The equivalent function for non-oapi is in php_vm.c. In that case, it has been called while we are in the middle of php_execute and can assume certain things among them being we can just check the execute_data opcodes and that the start opcode is ZEND_DO_FCALL (of which we know certain facts) and iterate back as needed.

In legacy, to determine determine this is a call_user_func_array() call we have to look at the previous opcodes of zend_execute_data.
In oapi, to determine if this is a call_user_func_array() call we have to look at the previous opcodes of zend_execute_data->prev_execute_data.
For both we know:
ZEND_DO_FCALL will never be the first opcode in an op array -- minimally, there is always at least a ZEND_INIT_FCALL before it -- so it is safe to iterate backwards in the opcode like execute_data->prev_execute_data->opline - 1

In oapi, the path to the equivalent function didn't have a guaranteed existence of zend_execute_data and therefore was causing some segfaults (specifically noticed for PHP 8.2 on wordpress which makes extensive use of cufa for hook implementation). Additionally, we are not guaranteed that the prev_execute_data opcode is ZEND_DO_FCALL so we can't blindly iterate backwards on it.

This PR does two things:

Check if execute_data is NULL before attempting to access its elements.
verify the opcode is ZEND_DO_FCALL before we iterate back to determine if it was called from call_user_func_array.

Testing:
Build from source soak tests for PHP 8.2 and wordpress no longer segfault.

Since our logic depends on what we know about ZEND_DO_FCALL, verify this is actually a ZEND_DO_FCALL otherwise exit.

zsistla · 2023-03-14T14:50:36Z

ok jenkins

lavarou

@zsistla I would appreciate if you could you add some context to the description of this PR. I.e. what code path caused the execute_data to be NULL and under which circumstances this function was called when ZEND_DO_FCALL != execute_data->prev_execute_data->opline->opcode? This would be most helpful to understand the need for these changes.

agent/php_execute.c

zsistla · 2023-03-14T15:27:37Z

@zsistla I would appreciate if you could you add some context to the description of this PR. I.e. what code path caused the execute_data to be NULL and under which circumstances this function was called when ZEND_DO_FCALL != execute_data->prev_execute_data->opline->opcode? This would be most helpful to understand the need for these changes.

Please see updated description.

zsistla · 2023-03-14T15:42:57Z

ok jenkins

zsistla · 2023-03-14T16:31:22Z

ok jenkins

zsistla · 2023-03-14T18:07:08Z

ok jenkins

zsistla · 2023-03-14T20:04:49Z

ok jenkins

lavarou

  if (ZEND_DO_FCALL != execute_data->prev_execute_data->opline->opcode) {
    return;
  }

is the key in this PR and addresses the segfault, caused by OAPI's instrumentation of user functions called via call_user_function_array which assumes that DO_FCALL is the opcode of the previous call frame. This needs to be ensured and that extra check does it. Nice find!

agent/php_execute.c

Added a message when execute_data is NULL and removed the check for execute_data->opline as we don't use that value.

lavarou · 2023-03-17T14:31:38Z

Here's a suggestion I have that will improve code review experience: apply a coding style with clang-format only to new code added to an existing code base. This will reduce 'background noise' and help reviewers keep focused on the new/changed code. Here's how this can be added to the code/commit workflow:

Disable automatic execution of clang-format on save in your editor!!!
Code like there's no tomorrow to add new features or fix bugs
git add
git clang-format
git status
git add <files formatted in step 4.>
git commit

This could be semi-automated with git hook: https://github.com/barisione/clang-format-hooks/#using-the-pre-commit-hook.

lavarou · 2023-08-08T17:44:49Z

Obsolete. Superseded by #708.

fix(agent): Check for execute_data for NULL before accessing.

6b1e032

zsistla added the ready for review label Mar 13, 2023

zsistla added this to the OAPI Instrumentation milestone Mar 13, 2023

zsistla requested review from ZNeumann, mfulb, lavarou and bduranleau-nr March 13, 2023 20:46

zsistla changed the title ~~fix(agent): Check for execute_data for NULL before accessing.~~ fix(agent): Check execute_data for NULL before accessing. Mar 13, 2023

fix(agent): Ensure ZEND_DO_FCALL.

b44b95f

Since our logic depends on what we know about ZEND_DO_FCALL, verify this is actually a ZEND_DO_FCALL otherwise exit.

lavarou reviewed Mar 14, 2023

View reviewed changes

agent/php_execute.c Show resolved Hide resolved

agent/php_execute.c Show resolved Hide resolved

agent/php_execute.c Show resolved Hide resolved

agent/php_execute.c Show resolved Hide resolved

zsistla changed the title ~~fix(agent): Check execute_data for NULL before accessing.~~ fix(agent): Additional checks needed before iterating backwards in opcodes. Mar 14, 2023

lavarou approved these changes Mar 14, 2023

View reviewed changes

agent/php_execute.c Outdated Show resolved Hide resolved

ZNeumann approved these changes Mar 17, 2023

View reviewed changes

fix(agent): Add message when execute_data is NULL.

b8052af

Added a message when execute_data is NULL and removed the check for execute_data->opline as we don't use that value.

mfulb approved these changes Mar 22, 2023

View reviewed changes

bduranleau-nr approved these changes Mar 28, 2023

View reviewed changes

lavarou closed this Aug 8, 2023

lavarou mentioned this pull request Aug 8, 2023

fix: add guards to oapi cufa opcode check #708

Merged

zsistla deleted the cufa_segfault branch September 13, 2024 19:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(agent): Additional checks needed before iterating backwards in opcodes. #639

fix(agent): Additional checks needed before iterating backwards in opcodes. #639

zsistla commented Mar 13, 2023 •

edited

Loading

zsistla commented Mar 14, 2023

lavarou left a comment

zsistla commented Mar 14, 2023

zsistla commented Mar 14, 2023

zsistla commented Mar 14, 2023

zsistla commented Mar 14, 2023

zsistla commented Mar 14, 2023

lavarou left a comment

lavarou commented Mar 17, 2023 •

edited

Loading

lavarou commented Aug 8, 2023

fix(agent): Additional checks needed before iterating backwards in opcodes. #639

fix(agent): Additional checks needed before iterating backwards in opcodes. #639

Conversation

zsistla commented Mar 13, 2023 • edited Loading

zsistla commented Mar 14, 2023

lavarou left a comment

Choose a reason for hiding this comment

zsistla commented Mar 14, 2023

zsistla commented Mar 14, 2023

zsistla commented Mar 14, 2023

zsistla commented Mar 14, 2023

zsistla commented Mar 14, 2023

lavarou left a comment

Choose a reason for hiding this comment

lavarou commented Mar 17, 2023 • edited Loading

lavarou commented Aug 8, 2023

zsistla commented Mar 13, 2023 •

edited

Loading

lavarou commented Mar 17, 2023 •

edited

Loading