Annotate kernels separately by michaelmckinsey1 · Pull Request #695 · llnl/RAJAPerf

michaelmckinsey1 · 2026-06-24T23:09:18Z

Summary

This PR is additional Caliper regions for RAJAPerf kernels that have multiple kernel launches.
It does the following (modify list as needed):
- Adds additional Caliper regions at the request of @pearce8
There is currently no synchronize for these regions for asynchronous GPU kernels. But these should be profiled with CUDA/HIP events anyway, which do not need CPU synchronization to measure GPU time.
add function types to be able to filter out kernels instead of type=function -> type=subkernel

Examples

Polybench_JACOBI_1D has one launch per rep for poly_jacobi_1D_1 and one for poly_jacobi_1D_2. So its tree will now profile each separately:

POLYBENCH_FLOYD_WARSHALL, HALO_PACKING, HALO_EXCHANGE all have variable amount of launches, so we will append _k instead of adding k regions to the tree

All kernels with only 1 launch per rep are unchanged.

michaelmckinsey1 · 2026-06-26T21:05:52Z

made a draft because need to add different types to these regions so we can filter them with caliper

artv3 · 2026-06-29T01:09:32Z

+        RP_CALI_MARK_END(RP_CALI_REGION(ENERGY_1));

+        RP_CALI_MARK_BEGIN(RP_CALI_REGION(ENERGY_2));
        RAJA::forall< RAJA::cuda_exec<block_size, async> >( res,


@michaelmckinsey1 , why not use just use RAJA's kernel naming capability here? Example...

RAJA::forall<RAJA::cuda_exec<256>>( range, RAJA::Name("VectorAddKernel"), // <-- Kernel Name injected here [=] RAJA_DEVICE (int i) { c[i] = a[i] + b[i]; } );

michaelmckinsey1 added 4 commits June 24, 2026 15:53

Annotate kernels as example

08bf9eb

Change names

9a0ee0b

add more

7dd29bc

Full JACOBI_1D example

7b1d348

michaelmckinsey1 self-assigned this Jun 24, 2026

michaelmckinsey1 added 2 commits June 24, 2026 17:11

All applicable kernels

7309ab0

Refactor and _k for variable regions

ddb24c5

michaelmckinsey1 requested a review from pearce8 June 26, 2026 19:49

michaelmckinsey1 changed the title ~~[WIP] Annotate kernels separately~~ Annotate kernels separately Jun 26, 2026

michaelmckinsey1 marked this pull request as ready for review June 26, 2026 19:49

michaelmckinsey1 marked this pull request as draft June 26, 2026 21:04

artv3 reviewed Jun 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Annotate kernels separately#695

Annotate kernels separately#695
michaelmckinsey1 wants to merge 6 commits into
developfrom
multi-kernel-regions

michaelmckinsey1 commented Jun 24, 2026 •

edited

Loading

Uh oh!

michaelmckinsey1 commented Jun 26, 2026

Uh oh!

artv3 Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

michaelmckinsey1 commented Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Examples

Uh oh!

michaelmckinsey1 commented Jun 26, 2026

Uh oh!

artv3 Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

michaelmckinsey1 commented Jun 24, 2026 •

edited

Loading