backport `unreachable` #2852

davebayer · 2024-11-18T12:08:21Z

This PR backports cuda::std::unreachable to C++11, replacing old implementation (_CCCL_UNREACHABLE).

copy-pr-bot · 2024-11-18T12:08:24Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

libcudacxx/include/cuda/std/__utility/unreachable.h

libcudacxx/include/cuda/std/__cccl/unreachable.h

libcudacxx/include/cuda/std/detail/libcxx/include/array

libcudacxx/include/cuda/std/__utility/unreachable.h

miscco · 2024-11-18T12:54:29Z

It also fixes the implementation, because for device code the implementation was never right as __CUDA_ARCH__ is never defined outside device code.

Can you elaborate on this a bit. Outside of __CUDA_ARCH__ the macro was defined to just be __builtin_unreachable for non MSVC

davebayer · 2024-11-18T13:24:45Z

It also fixes the implementation, because for device code the implementation was never right as __CUDA_ARCH__ is never defined outside device code.

Can you elaborate on this a bit. Outside of __CUDA_ARCH__ the macro was defined to just be __builtin_unreachable for non MSVC

I didn't know NVHPC does not define __CUDA_ARCH__ in device code.. But I think at least for _CCCL_CUDACC_BELOW(11, 2) and _CCCL_CUDACC_BELOW(11, 3) the old implementation of unreachable should not work, because __CUDA_ARCH__ is never defined outside __device__ or __global__ scope for nvcc.

miscco · 2024-11-18T15:07:00Z

/ok to test

libcudacxx/include/cuda/std/__utility/unreachable.h

* Move implementation of `_LIBCUDACXX_TEMPLATE` to CCCL We have emulation for concepts in LIBCUDACXX that was guarded behind C++14 But there is nothing that requires C++14 for just the template headers and we want to use them universally throughout the codebase Consequently move them to CCCL proper and enable them unconditionally. To ensure that we do not add any hidden dependencies this also adds a barebones implementation of `enable_if_t` and a trailing `enable_if_t`

…ons (NVIDIA#2889) NVHPC can consume older CTK headers for stdpar, so we need to try and avoid using those

…DIA#2921) fixes NVIDIA#2362

Co-authored-by: Bernhard Manfred Gruber <[email protected]>

Co-authored-by: Michael Schellenberger Costa <[email protected]> Co-authored-by: Bernhard Manfred Gruber <[email protected]> Co-authored-by: Eric Niebler <[email protected]>

Co-authored-by: Allard Hendriksen <[email protected]>

* Improve build instructions for libcu++ * Add section about the options for the build script * Delegate more to the contributor guidelines

NVIDIA#2923) Co-authored-by: Michael Schellenberger Costa <[email protected]>

* Move implementation of `_LIBCUDACXX_TEMPLATE` to CCCL We have emulation for concepts in LIBCUDACXX that was guarded behind C++14 But there is nothing that requires C++14 for just the template headers and we want to use them universally throughout the codebase Consequently move them to CCCL proper and enable them unconditionally. To ensure that we do not add any hidden dependencies this also adds a barebones implementation of `enable_if_t` and a trailing `enable_if_t`

…DIA#2921) fixes NVIDIA#2362

Co-authored-by: Michael Schellenberger Costa <[email protected]> Co-authored-by: Bernhard Manfred Gruber <[email protected]> Co-authored-by: Eric Niebler <[email protected]>

NVIDIA#2923) Co-authored-by: Michael Schellenberger Costa <[email protected]>

Fixes: NVIDIA#2939

agent_histogram.cuh:827:37: warning: comparison of different enumeration types

* Drop unused macros * Do not return a custom struct * Replace `__MDSPAN_FOLD_AND` with `__fold_and_v` when possible

…VIDIA#2950) * Implement CUDASTF_DOT_TIMING facility for ctx.cuda_kernel * clang-format

Fixes [BUG]: UB in annotated_ptr NVIDIA#2942

This is adding the missing sm_100 identifier to nv/target Fixes NVIDIA#2890

Overwrites all generated PTX header and documentation files and runs `pre-commit run --all-files`. Also exclude generated PTX headers from header check.

Overwrites all generated PTX tests and runs `pre-commit run --all-files`

…NVIDIA#2956) Fixes NVIDIA#2933

* Implement copy_bytes for mdspan * Add final conversion to mdspan and more tests * mdspan fill_bytes * Add docs * Fix issues after rebase * Help old GCC figure out the types * Move runtime extents check to a function * Fix clang and more old GCC fixes

davebayer added 2 commits November 18, 2024 13:00

backport and fix unreachable

fdc6c36

remove _CCCL_UNREACHABLE

0246df1

davebayer requested review from a team as code owners November 18, 2024 12:08

davebayer requested review from wmaxey and gonidelis November 18, 2024 12:08

davebayer requested a review from bernhardmgruber November 18, 2024 12:08

update tests

0d83892

bernhardmgruber reviewed Nov 18, 2024

View reviewed changes

libcudacxx/include/cuda/std/__utility/unreachable.h Show resolved Hide resolved

davebayer added 2 commits November 18, 2024 13:25

define cccl library unreachable version

f6f2574

use _CCCL_NORETURN instead of standard [[noreturn]]

94101f2

miscco requested changes Nov 18, 2024

View reviewed changes

davebayer added 2 commits November 18, 2024 14:00

use _CUDA_VSTD within libcu++

4f0c516

fix for cuda compilation with NVHPC

a8e3f60

fix compilation for host/device unreachable

bed27be

miscco reviewed Nov 18, 2024

View reviewed changes

libcudacxx/include/cuda/std/__utility/unreachable.h Outdated Show resolved Hide resolved

miscco reviewed Nov 18, 2024

View reviewed changes

libcudacxx/include/cuda/std/__utility/unreachable.h Outdated Show resolved Hide resolved

separate host and device implementations

86e4ce8

miscco requested changes Nov 19, 2024

View reviewed changes

libcudacxx/include/cuda/std/__utility/unreachable.h Outdated Show resolved Hide resolved

libcudacxx/include/cuda/std/__utility/unreachable.h Show resolved Hide resolved

davebayer and others added 2 commits November 19, 2024 21:57

fall back to the original implementation

f7affcf

Merge branch 'main' into backport_and_fix_unreachable

3208ba5

davebayer requested a review from miscco November 19, 2024 21:18

davebayer changed the title ~~backport and fix unreachable~~ backport unreachable Nov 19, 2024

miscco and others added 29 commits November 22, 2024 16:23

Try to work around issue with NVHPC in conjunction of older CTK versi…

786d442

…ons (NVIDIA#2889) NVHPC can consume older CTK headers for stdpar, so we need to try and avoid using those

Refactoring (NVIDIA#2905)

6b5fa22

add "interface" to _CCCL_PUSH_MACROS (NVIDIA#2919)

44f0331

Replace inconsistent Doxygen macros with _CCCL_DOXYGEN_INVOKED (NVI…

c9a6e6a

…DIA#2921) fixes NVIDIA#2362

implement C++26 std::span::at (NVIDIA#2924)

a50019d

Co-authored-by: Bernhard Manfred Gruber <[email protected]>

move msvc compiler macros to new version (NVIDIA#2885)

d0f5bd2

Co-authored-by: Michael Schellenberger Costa <[email protected]> Co-authored-by: Bernhard Manfred Gruber <[email protected]> Co-authored-by: Eric Niebler <[email protected]>

Reorganize PTX tests to match generator (NVIDIA#2930)

5b57a4c

Reorganize PTX docs to match generator (NVIDIA#2929)

96e8199

Co-authored-by: Allard Hendriksen <[email protected]>

Improve build instructions for libcu++ (NVIDIA#2881)

92a22f2

* Improve build instructions for libcu++ * Add section about the options for the build script * Delegate more to the contributor guidelines

Reorganize PTX headers to match generator (NVIDIA#2925)

14484a6

implement C++26 std::span's constructor from std::initializer_list (

e897f1f

NVIDIA#2923) Co-authored-by: Michael Schellenberger Costa <[email protected]>

Add tuple protocol to cuda::std::complex from C++26 (NVIDIA#2882)

3b23083

Replace inconsistent Doxygen macros with _CCCL_DOXYGEN_INVOKED (NVI…

3911b25

…DIA#2921) fixes NVIDIA#2362

move msvc compiler macros to new version (NVIDIA#2885)

b760c7f

Co-authored-by: Michael Schellenberger Costa <[email protected]> Co-authored-by: Bernhard Manfred Gruber <[email protected]> Co-authored-by: Eric Niebler <[email protected]>

implement C++26 std::span's constructor from std::initializer_list (

8ee0b18

NVIDIA#2923) Co-authored-by: Michael Schellenberger Costa <[email protected]>

Add missing qualifier for cuda namespace (NVIDIA#2940)

ef9ae72

Fixes: NVIDIA#2939

Try to fix a clang warning: (NVIDIA#2941)

5e4d6e0

agent_histogram.cuh:827:37: warning: comparison of different enumeration types

minor consistency improvements in concepts macros (NVIDIA#2928)

6dbeb46

Drop some of the mdspan fold implementation (NVIDIA#2949)

2a47168

* Drop unused macros * Do not return a custom struct * Replace `__MDSPAN_FOLD_AND` with `__fold_and_v` when possible

[STF] Implement CUDASTF_DOT_TIMING for the ctx.cuda_kernel construct (N…

a4477a4

…VIDIA#2950) * Implement CUDASTF_DOT_TIMING facility for ctx.cuda_kernel * clang-format

Avoid potential null dereference in annotated_ptr (NVIDIA#2951)

09db35f

Fixes [BUG]: UB in annotated_ptr NVIDIA#2942

make compiler version comparison utility generic (NVIDIA#2952)

8ca5380

Add SM100 descriptor to target (NVIDIA#2954)

3c6fcd2

This is adding the missing sm_100 identifier to nv/target Fixes NVIDIA#2890

Regenerate cuda::ptx headers/docs and run format (NVIDIA#2937)

aededb9

Overwrites all generated PTX header and documentation files and runs `pre-commit run --all-files`. Also exclude generated PTX headers from header check.

Regenerate PTX test (NVIDIA#2953)

4d53204

Overwrites all generated PTX tests and runs `pre-commit run --all-files`

Do not include extended floating point headers if they are not needed (…

bdff820

…NVIDIA#2956) Fixes NVIDIA#2933

davebayer requested a review from a team as a code owner November 25, 2024 21:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

backport `unreachable` #2852

backport `unreachable` #2852

davebayer commented Nov 18, 2024 •

edited

Loading

copy-pr-bot bot commented Nov 18, 2024

miscco commented Nov 18, 2024 •

edited

Loading

davebayer commented Nov 18, 2024 •

edited

Loading

miscco commented Nov 18, 2024

backport unreachable #2852

Are you sure you want to change the base?

backport unreachable #2852

Conversation

davebayer commented Nov 18, 2024 • edited Loading

copy-pr-bot bot commented Nov 18, 2024

miscco commented Nov 18, 2024 • edited Loading

davebayer commented Nov 18, 2024 • edited Loading

miscco commented Nov 18, 2024

backport `unreachable` #2852

backport `unreachable` #2852

davebayer commented Nov 18, 2024 •

edited

Loading

miscco commented Nov 18, 2024 •

edited

Loading

davebayer commented Nov 18, 2024 •

edited

Loading