Skip to content

[Bug] C++ API and core tests fail with "assert 3 == 0" across all GPUs and CUDA 13.x versions #4152

@apbose

Description

@apbose

Bug Description

C++ test suites //tests/core:core_tests and //tests:cpp_api_tests consistently fail with assert 3 == 0 across all tested GPU architectures (RTX 3070, B100-TS2, H100, A100) and all CUDA 13.x versions. The failure is 100% reproducible across every configuration, suggesting a systemic issue rather than a hardware- or driver-specific one.

Environment

  • GPUs: RTX 3070, B100-TS2, H100, A100
  • Arch: x86_64
  • CUDA: 13.2.0 / 13.1.1 / 13.0.2
  • OS: Ubuntu 24.04
  • cuDNN: 8.9.7.29
  • Myelin: 2.17.78+7
  • CASK: 5.16.17+1
  • Python: 3.12
  • Package: qa_tar_py3.12

Failing Tests

FAILED test_torch_trt_cpp[//tests/core:core_tests] - assert 3 == 0
FAILED test_torch_trt_cpp[//tests:cpp_api_tests] - assert 3 == 0

Reproducible Configurations

GPU CUDA Result
RTX 3070/x86_64 r13.0.2, r13.1.1, r13.2.0 FAILED
A100/x86_64 r13.0.2, r13.1.1, r13.2.0 FAILED
H100/x86_64 r13.1.1, r13.2.0 FAILED
B100-TS2/x86_64 r13.0.2, r13.1.1, r13.2.0 FAILED

All 22 logged test runs failed — no passing configuration observed.

Steps to Reproduce

  1. Run on any of the listed GPUs with CUDA 13.x and the environment above
  2. Execute the C++ test suites:
    • //tests/core:core_tests
    • //tests:cpp_api_tests

Expected Behavior

Both C++ test suites should pass with assert 0 == 0.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions