Skip to content

Add implementation that supports large segments in cub::DeviceSegmentedTopK #8363

@elstehle

Description

@elstehle

The goal is to have an implementation that can process a batch of segments, where each segment may require multiple thread blocks to collaborate to compute the result

Tasks:

  • Write tests for large segments
  • Evaluate approach
  • ...

Depends on:
No dependencies.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    Status

    Todo

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions