Conversation
Contributor
|
Your PR requires formatting changes to meet the project's style guidelines. Click here to view the suggested changes.diff --git a/test/core/pointer.jl b/test/core/pointer.jl
index 224fe2a24..f09fe31a3 100644
--- a/test/core/pointer.jl
+++ b/test/core/pointer.jl
@@ -84,7 +84,7 @@ end
@test sprint(show, ref) == "CuRefValue{Int64}(1)"
@test ref[] == 1
@test Base.unsafe_convert(CuPtr{Cvoid}, ref) isa CuPtr{Cvoid}
-
+
arr_ref = convert(CuRef{Int64}, CUDA.ones(Int64, 1))
@test eltype(arr_ref) == Int64
@test convert(CuRef{Int64}, arr_ref) === arr_ref |
maleadt
approved these changes
Mar 20, 2025
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## master #2706 +/- ##
==========================================
+ Coverage 86.63% 86.69% +0.06%
==========================================
Files 153 153
Lines 13116 13116
==========================================
+ Hits 11363 11371 +8
+ Misses 1753 1745 -8 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
Contributor
There was a problem hiding this comment.
CUDA.jl Benchmarks
Details
| Benchmark suite | Current: 1e40aa2 | Previous: ab74bfc | Ratio |
|---|---|---|---|
latency/precompile |
46033279481 ns |
45609312670.5 ns |
1.01 |
latency/ttfp |
6494214097.5 ns |
6509804251 ns |
1.00 |
latency/import |
3167430533.5 ns |
3165547030 ns |
1.00 |
integration/volumerhs |
9624632 ns |
9618874 ns |
1.00 |
integration/byval/slices=1 |
147038 ns |
146985 ns |
1.00 |
integration/byval/slices=3 |
425334 ns |
425068 ns |
1.00 |
integration/byval/reference |
145124 ns |
145136 ns |
1.00 |
integration/byval/slices=2 |
286522 ns |
286134 ns |
1.00 |
integration/cudadevrt |
103512 ns |
103387 ns |
1.00 |
kernel/indexing |
14286 ns |
14093 ns |
1.01 |
kernel/indexing_checked |
14757 ns |
14974 ns |
0.99 |
kernel/occupancy |
686.4701986754967 ns |
665.62893081761 ns |
1.03 |
kernel/launch |
2152.2 ns |
2199.6666666666665 ns |
0.98 |
kernel/rand |
14884 ns |
15117 ns |
0.98 |
array/reverse/1d |
19750 ns |
19957 ns |
0.99 |
array/reverse/2d |
25355 ns |
24198 ns |
1.05 |
array/reverse/1d_inplace |
10384.666666666666 ns |
10158 ns |
1.02 |
array/reverse/2d_inplace |
11195 ns |
11719 ns |
0.96 |
array/copy |
20798 ns |
21382 ns |
0.97 |
array/iteration/findall/int |
159025 ns |
157735 ns |
1.01 |
array/iteration/findall/bool |
138963.5 ns |
138994.5 ns |
1.00 |
array/iteration/findfirst/int |
154894 ns |
154611 ns |
1.00 |
array/iteration/findfirst/bool |
155577 ns |
155155 ns |
1.00 |
array/iteration/scalar |
71873 ns |
70419 ns |
1.02 |
array/iteration/logical |
217091 ns |
215597 ns |
1.01 |
array/iteration/findmin/1d |
42433 ns |
42227 ns |
1.00 |
array/iteration/findmin/2d |
94618 ns |
94292 ns |
1.00 |
array/reductions/reduce/1d |
44047.5 ns |
36180 ns |
1.22 |
array/reductions/reduce/2d |
47953 ns |
50906.5 ns |
0.94 |
array/reductions/mapreduce/1d |
42130 ns |
34242 ns |
1.23 |
array/reductions/mapreduce/2d |
45774 ns |
41685.5 ns |
1.10 |
array/broadcast |
21180 ns |
20673 ns |
1.02 |
array/copyto!/gpu_to_gpu |
11947 ns |
13667 ns |
0.87 |
array/copyto!/cpu_to_gpu |
211379.5 ns |
210128 ns |
1.01 |
array/copyto!/gpu_to_cpu |
246657 ns |
243984 ns |
1.01 |
array/accumulate/1d |
109213 ns |
109871 ns |
0.99 |
array/accumulate/2d |
80361 ns |
80363 ns |
1.00 |
array/construct |
1310.1 ns |
1295.6 ns |
1.01 |
array/random/randn/Float32 |
48171 ns |
49526 ns |
0.97 |
array/random/randn!/Float32 |
26561 ns |
26634 ns |
1.00 |
array/random/rand!/Int64 |
27243.5 ns |
27280 ns |
1.00 |
array/random/rand!/Float32 |
8769 ns |
8717.666666666666 ns |
1.01 |
array/random/rand/Int64 |
30080.5 ns |
30145 ns |
1.00 |
array/random/rand/Float32 |
13136 ns |
13266 ns |
0.99 |
array/permutedims/4d |
61274 ns |
61147 ns |
1.00 |
array/permutedims/2d |
56243 ns |
55994 ns |
1.00 |
array/permutedims/3d |
57608 ns |
56646 ns |
1.02 |
array/sorting/1d |
2776181 ns |
2777259 ns |
1.00 |
array/sorting/by |
3367546 ns |
3369070 ns |
1.00 |
array/sorting/2d |
1086416.5 ns |
1085925.5 ns |
1.00 |
cuda/synchronization/stream/auto |
1034 ns |
1011.6 ns |
1.02 |
cuda/synchronization/stream/nonblocking |
6654.8 ns |
6554.8 ns |
1.02 |
cuda/synchronization/stream/blocking |
848.5512820512821 ns |
833.1359223300971 ns |
1.02 |
cuda/synchronization/context/auto |
1213.7 ns |
1175.8 ns |
1.03 |
cuda/synchronization/context/nonblocking |
6784.4 ns |
6767.2 ns |
1.00 |
cuda/synchronization/context/blocking |
946.5333333333333 ns |
939.3666666666667 ns |
1.01 |
This comment was automatically generated by workflow using github-action-benchmark.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.