Conversation
maleadt
approved these changes
Mar 19, 2025
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## master #2701 +/- ##
==========================================
+ Coverage 83.47% 83.51% +0.03%
==========================================
Files 153 153
Lines 13592 13592
==========================================
+ Hits 11346 11351 +5
+ Misses 2246 2241 -5 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
Contributor
There was a problem hiding this comment.
CUDA.jl Benchmarks
Details
| Benchmark suite | Current: a541398 | Previous: 7fdd9e0 | Ratio |
|---|---|---|---|
latency/precompile |
45939251851 ns |
45683028153.5 ns |
1.01 |
latency/ttfp |
6483314063.5 ns |
6565374818 ns |
0.99 |
latency/import |
3162349927 ns |
3199458367.5 ns |
0.99 |
integration/volumerhs |
9624320 ns |
9625606 ns |
1.00 |
integration/byval/slices=1 |
147187 ns |
146708 ns |
1.00 |
integration/byval/slices=3 |
425371 ns |
425185 ns |
1.00 |
integration/byval/reference |
145130 ns |
145038 ns |
1.00 |
integration/byval/slices=2 |
286240 ns |
285862 ns |
1.00 |
integration/cudadevrt |
103446 ns |
103315 ns |
1.00 |
kernel/indexing |
14045 ns |
14074 ns |
1.00 |
kernel/indexing_checked |
14994 ns |
14777.5 ns |
1.01 |
kernel/occupancy |
642.2321428571429 ns |
673.5333333333333 ns |
0.95 |
kernel/launch |
2139.777777777778 ns |
2091.2 ns |
1.02 |
kernel/rand |
16809 ns |
15853.5 ns |
1.06 |
array/reverse/1d |
19677 ns |
19669 ns |
1.00 |
array/reverse/2d |
23806 ns |
23622 ns |
1.01 |
array/reverse/1d_inplace |
10114 ns |
9923 ns |
1.02 |
array/reverse/2d_inplace |
11882.5 ns |
11511 ns |
1.03 |
array/copy |
21239 ns |
21077 ns |
1.01 |
array/iteration/findall/int |
157297 ns |
157492 ns |
1.00 |
array/iteration/findall/bool |
138353 ns |
138370 ns |
1.00 |
array/iteration/findfirst/int |
154054 ns |
154134 ns |
1.00 |
array/iteration/findfirst/bool |
154790 ns |
155050.5 ns |
1.00 |
array/iteration/scalar |
71484 ns |
71906 ns |
0.99 |
array/iteration/logical |
213262.5 ns |
213074 ns |
1.00 |
array/iteration/findmin/1d |
41488 ns |
41644.5 ns |
1.00 |
array/iteration/findmin/2d |
94149 ns |
94043.5 ns |
1.00 |
array/reductions/reduce/1d |
35363 ns |
35292 ns |
1.00 |
array/reductions/reduce/2d |
51340 ns |
51061 ns |
1.01 |
array/reductions/mapreduce/1d |
33252 ns |
33511 ns |
0.99 |
array/reductions/mapreduce/2d |
42047.5 ns |
41204 ns |
1.02 |
array/broadcast |
20751 ns |
20524 ns |
1.01 |
array/copyto!/gpu_to_gpu |
11912 ns |
11637 ns |
1.02 |
array/copyto!/cpu_to_gpu |
207956 ns |
207801 ns |
1.00 |
array/copyto!/gpu_to_cpu |
244278 ns |
244455 ns |
1.00 |
array/accumulate/1d |
109117 ns |
109057 ns |
1.00 |
array/accumulate/2d |
79661 ns |
79689 ns |
1.00 |
array/construct |
1281.5 ns |
1275.1 ns |
1.01 |
array/random/randn/Float32 |
43319 ns |
43977.5 ns |
0.99 |
array/random/randn!/Float32 |
26508 ns |
26667 ns |
0.99 |
array/random/rand!/Int64 |
27276 ns |
27133 ns |
1.01 |
array/random/rand!/Float32 |
8726 ns |
8642.333333333334 ns |
1.01 |
array/random/rand/Int64 |
30002 ns |
29842.5 ns |
1.01 |
array/random/rand/Float32 |
13089 ns |
12971 ns |
1.01 |
array/permutedims/4d |
60897 ns |
60954 ns |
1.00 |
array/permutedims/2d |
55325 ns |
54987.5 ns |
1.01 |
array/permutedims/3d |
56147.5 ns |
55653 ns |
1.01 |
array/sorting/1d |
2777748 ns |
2775426 ns |
1.00 |
array/sorting/by |
3368664 ns |
3366795.5 ns |
1.00 |
array/sorting/2d |
1085341 ns |
1084590 ns |
1.00 |
cuda/synchronization/stream/auto |
1026.2 ns |
1034.8 ns |
0.99 |
cuda/synchronization/stream/nonblocking |
6481.9 ns |
6443 ns |
1.01 |
cuda/synchronization/stream/blocking |
836.5721649484535 ns |
802.8020833333334 ns |
1.04 |
cuda/synchronization/context/auto |
1165 ns |
1172.6 ns |
0.99 |
cuda/synchronization/context/nonblocking |
6835 ns |
6645.8 ns |
1.03 |
cuda/synchronization/context/blocking |
940.3055555555555 ns |
954.2142857142857 ns |
0.99 |
This comment was automatically generated by workflow using github-action-benchmark.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.