Skip to content

More tests for some missing kron methods#2701

Merged
kshyatt merged 1 commit intomasterfrom
ksh/kron
Mar 19, 2025
Merged

More tests for some missing kron methods#2701
kshyatt merged 1 commit intomasterfrom
ksh/kron

Conversation

@kshyatt
Copy link
Copy Markdown
Member

@kshyatt kshyatt commented Mar 19, 2025

No description provided.

@kshyatt kshyatt added cuda libraries Stuff about CUDA library wrappers. tests Adds or changes tests. labels Mar 19, 2025
@codecov
Copy link
Copy Markdown

codecov Bot commented Mar 19, 2025

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 83.51%. Comparing base (7fdd9e0) to head (a541398).
Report is 3 commits behind head on master.

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #2701      +/-   ##
==========================================
+ Coverage   83.47%   83.51%   +0.03%     
==========================================
  Files         153      153              
  Lines       13592    13592              
==========================================
+ Hits        11346    11351       +5     
+ Misses       2246     2241       -5     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copy link
Copy Markdown
Contributor

@github-actions github-actions Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

CUDA.jl Benchmarks

Details
Benchmark suite Current: a541398 Previous: 7fdd9e0 Ratio
latency/precompile 45939251851 ns 45683028153.5 ns 1.01
latency/ttfp 6483314063.5 ns 6565374818 ns 0.99
latency/import 3162349927 ns 3199458367.5 ns 0.99
integration/volumerhs 9624320 ns 9625606 ns 1.00
integration/byval/slices=1 147187 ns 146708 ns 1.00
integration/byval/slices=3 425371 ns 425185 ns 1.00
integration/byval/reference 145130 ns 145038 ns 1.00
integration/byval/slices=2 286240 ns 285862 ns 1.00
integration/cudadevrt 103446 ns 103315 ns 1.00
kernel/indexing 14045 ns 14074 ns 1.00
kernel/indexing_checked 14994 ns 14777.5 ns 1.01
kernel/occupancy 642.2321428571429 ns 673.5333333333333 ns 0.95
kernel/launch 2139.777777777778 ns 2091.2 ns 1.02
kernel/rand 16809 ns 15853.5 ns 1.06
array/reverse/1d 19677 ns 19669 ns 1.00
array/reverse/2d 23806 ns 23622 ns 1.01
array/reverse/1d_inplace 10114 ns 9923 ns 1.02
array/reverse/2d_inplace 11882.5 ns 11511 ns 1.03
array/copy 21239 ns 21077 ns 1.01
array/iteration/findall/int 157297 ns 157492 ns 1.00
array/iteration/findall/bool 138353 ns 138370 ns 1.00
array/iteration/findfirst/int 154054 ns 154134 ns 1.00
array/iteration/findfirst/bool 154790 ns 155050.5 ns 1.00
array/iteration/scalar 71484 ns 71906 ns 0.99
array/iteration/logical 213262.5 ns 213074 ns 1.00
array/iteration/findmin/1d 41488 ns 41644.5 ns 1.00
array/iteration/findmin/2d 94149 ns 94043.5 ns 1.00
array/reductions/reduce/1d 35363 ns 35292 ns 1.00
array/reductions/reduce/2d 51340 ns 51061 ns 1.01
array/reductions/mapreduce/1d 33252 ns 33511 ns 0.99
array/reductions/mapreduce/2d 42047.5 ns 41204 ns 1.02
array/broadcast 20751 ns 20524 ns 1.01
array/copyto!/gpu_to_gpu 11912 ns 11637 ns 1.02
array/copyto!/cpu_to_gpu 207956 ns 207801 ns 1.00
array/copyto!/gpu_to_cpu 244278 ns 244455 ns 1.00
array/accumulate/1d 109117 ns 109057 ns 1.00
array/accumulate/2d 79661 ns 79689 ns 1.00
array/construct 1281.5 ns 1275.1 ns 1.01
array/random/randn/Float32 43319 ns 43977.5 ns 0.99
array/random/randn!/Float32 26508 ns 26667 ns 0.99
array/random/rand!/Int64 27276 ns 27133 ns 1.01
array/random/rand!/Float32 8726 ns 8642.333333333334 ns 1.01
array/random/rand/Int64 30002 ns 29842.5 ns 1.01
array/random/rand/Float32 13089 ns 12971 ns 1.01
array/permutedims/4d 60897 ns 60954 ns 1.00
array/permutedims/2d 55325 ns 54987.5 ns 1.01
array/permutedims/3d 56147.5 ns 55653 ns 1.01
array/sorting/1d 2777748 ns 2775426 ns 1.00
array/sorting/by 3368664 ns 3366795.5 ns 1.00
array/sorting/2d 1085341 ns 1084590 ns 1.00
cuda/synchronization/stream/auto 1026.2 ns 1034.8 ns 0.99
cuda/synchronization/stream/nonblocking 6481.9 ns 6443 ns 1.01
cuda/synchronization/stream/blocking 836.5721649484535 ns 802.8020833333334 ns 1.04
cuda/synchronization/context/auto 1165 ns 1172.6 ns 0.99
cuda/synchronization/context/nonblocking 6835 ns 6645.8 ns 1.03
cuda/synchronization/context/blocking 940.3055555555555 ns 954.2142857142857 ns 0.99

This comment was automatically generated by workflow using github-action-benchmark.

@kshyatt kshyatt enabled auto-merge (squash) March 19, 2025 19:23
@kshyatt kshyatt merged commit 19a7ea2 into master Mar 19, 2025
3 checks passed
@kshyatt kshyatt deleted the ksh/kron branch March 19, 2025 22:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cuda libraries Stuff about CUDA library wrappers. tests Adds or changes tests.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants