Add additional MPS Solve tests.#858
Merged
Merged
Conversation
Contributor
There was a problem hiding this comment.
Metal Benchmarks
Details
| Benchmark suite | Current: 912aa50 | Previous: 70bf387 | Ratio |
|---|---|---|---|
array/accumulate/Float32/1d |
584792 ns |
518125 ns |
1.13 |
array/accumulate/Float32/dims=1 |
572062.5 ns |
499417 ns |
1.15 |
array/accumulate/Float32/dims=1L |
8845959 ns |
8940917 ns |
0.99 |
array/accumulate/Float32/dims=2 |
633042 ns |
530750 ns |
1.19 |
array/accumulate/Float32/dims=2L |
3330000 ns |
2812354.5 ns |
1.18 |
array/accumulate/Int64/1d |
886375 ns |
927479.5 ns |
0.96 |
array/accumulate/Int64/dims=1 |
1129916.5 ns |
1113792 ns |
1.01 |
array/accumulate/Int64/dims=1L |
9604125 ns |
9788208 ns |
0.98 |
array/accumulate/Int64/dims=2 |
1455750 ns |
1462749.5 ns |
1.00 |
array/accumulate/Int64/dims=2L |
6574604.5 ns |
7004770.5 ns |
0.94 |
array/broadcast |
369000 ns |
335709 ns |
1.10 |
array/construct |
3500 ns |
3416 ns |
1.02 |
array/permutedims/2d |
626334 ns |
622375 ns |
1.01 |
array/permutedims/3d |
1102125 ns |
1096521 ns |
1.01 |
array/permutedims/4d |
1698375 ns |
1149229.5 ns |
1.48 |
array/private/copy |
419375 ns |
396208 ns |
1.06 |
array/private/copyto!/cpu_to_gpu |
383958.5 ns |
361333.5 ns |
1.06 |
array/private/copyto!/gpu_to_cpu |
381625 ns |
366375 ns |
1.04 |
array/private/copyto!/gpu_to_gpu |
359208 ns |
329458 ns |
1.09 |
array/private/iteration/findall/bool |
1147250 ns |
1189209 ns |
0.96 |
array/private/iteration/findall/int |
1280083 ns |
1321250 ns |
0.97 |
array/private/iteration/findfirst/bool |
1398209 ns |
1317041 ns |
1.06 |
array/private/iteration/findfirst/int |
1427583.5 ns |
1386333.5 ns |
1.03 |
array/private/iteration/findmin/1d |
1563208 ns |
1532542 ns |
1.02 |
array/private/iteration/findmin/2d |
1260917 ns |
1233042 ns |
1.02 |
array/private/iteration/logical |
1829167 ns |
1986750 ns |
0.92 |
array/private/iteration/scalar |
2837354 ns |
1774791 ns |
1.60 |
array/random/rand/Float32 |
613834 ns |
601750 ns |
1.02 |
array/random/rand/Int64 |
625417 ns |
646729 ns |
0.97 |
array/random/rand!/Float32 |
527750 ns |
502750 ns |
1.05 |
array/random/rand!/Int64 |
495875 ns |
473708 ns |
1.05 |
array/random/randn/Float32 |
569458 ns |
553375 ns |
1.03 |
array/random/randn!/Float32 |
480000 ns |
459166 ns |
1.05 |
array/reductions/mapreduce/Float32/1d |
704250 ns |
612312.5 ns |
1.15 |
array/reductions/mapreduce/Float32/dims=1 |
484875 ns |
445208 ns |
1.09 |
array/reductions/mapreduce/Float32/dims=1L |
686541.5 ns |
702083 ns |
0.98 |
array/reductions/mapreduce/Float32/dims=2 |
496020.5 ns |
449354.5 ns |
1.10 |
array/reductions/mapreduce/Float32/dims=2L |
1338833 ns |
907875 ns |
1.47 |
array/reductions/mapreduce/Int64/1d |
891708 ns |
807417 ns |
1.10 |
array/reductions/mapreduce/Int64/dims=1 |
774250 ns |
761583.5 ns |
1.02 |
array/reductions/mapreduce/Int64/dims=1L |
1080542 ns |
1100209 ns |
0.98 |
array/reductions/mapreduce/Int64/dims=2 |
932583.5 ns |
799291 ns |
1.17 |
array/reductions/mapreduce/Int64/dims=2L |
2170500 ns |
2169333 ns |
1.00 |
array/reductions/reduce/Float32/1d |
702125 ns |
610833 ns |
1.15 |
array/reductions/reduce/Float32/dims=1 |
484625 ns |
446334 ns |
1.09 |
array/reductions/reduce/Float32/dims=1L |
686167 ns |
703354.5 ns |
0.98 |
array/reductions/reduce/Float32/dims=2 |
399709 ns |
351666 ns |
1.14 |
array/reductions/reduce/Float32/dims=2L |
484500 ns |
490667 ns |
0.99 |
array/reductions/reduce/Int64/1d |
895625 ns |
802375 ns |
1.12 |
array/reductions/reduce/Int64/dims=1 |
773375 ns |
761375 ns |
1.02 |
array/reductions/reduce/Int64/dims=1L |
1074854.5 ns |
1096042 ns |
0.98 |
array/reductions/reduce/Int64/dims=2 |
931417 ns |
795333 ns |
1.17 |
array/reductions/reduce/Int64/dims=2L |
2151208 ns |
2151083 ns |
1.00 |
array/shared/copy |
152833 ns |
162500 ns |
0.94 |
array/shared/copyto!/cpu_to_gpu |
39917 ns |
39542 ns |
1.01 |
array/shared/copyto!/gpu_to_cpu |
39958 ns |
40667 ns |
0.98 |
array/shared/copyto!/gpu_to_gpu |
40458 ns |
43708.5 ns |
0.93 |
array/shared/iteration/findall/bool |
1149083.5 ns |
1197209 ns |
0.96 |
array/shared/iteration/findall/int |
1285500 ns |
1329500 ns |
0.97 |
array/shared/iteration/findfirst/bool |
1118645.5 ns |
1036750 ns |
1.08 |
array/shared/iteration/findfirst/int |
1150479.5 ns |
1012208.5 ns |
1.14 |
array/shared/iteration/findmin/1d |
1298812.5 ns |
1286104.5 ns |
1.01 |
array/shared/iteration/findmin/2d |
1264542 ns |
1235166 ns |
1.02 |
array/shared/iteration/logical |
1647583 ns |
1796791.5 ns |
0.92 |
array/shared/iteration/scalar |
3635.375 ns |
3463.5 ns |
1.05 |
integration/byval/reference |
1136208 ns |
1153792 ns |
0.98 |
integration/byval/slices=1 |
1137083 ns |
1156167 ns |
0.98 |
integration/byval/slices=2 |
2048292 ns |
2082104.5 ns |
0.98 |
integration/byval/slices=3 |
7074916.5 ns |
18075459 ns |
0.39 |
integration/metaldevrt |
441167 ns |
439875 ns |
1.00 |
kernel/indexing |
356292 ns |
335042 ns |
1.06 |
kernel/indexing_checked |
531417 ns |
495250 ns |
1.07 |
kernel/launch |
1966.7 ns |
1975 ns |
1.00 |
kernel/rand |
522791 ns |
500604.5 ns |
1.04 |
latency/import |
1768942375 ns |
1791687916 ns |
0.99 |
latency/precompile |
36941127624.5 ns |
37168987104 ns |
0.99 |
latency/ttfp |
2095601334 ns |
2129036000 ns |
0.98 |
metal/synchronization/context |
529.3626943005181 ns |
527.6373056994819 ns |
1.00 |
metal/synchronization/stream |
278.9512987012987 ns |
275.8242811501597 ns |
1.01 |
This comment was automatically generated by workflow using github-action-benchmark.
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #858 +/- ##
==========================================
+ Coverage 86.61% 86.67% +0.05%
==========================================
Files 76 76
Lines 5134 5134
==========================================
+ Hits 4447 4450 +3
+ Misses 687 684 -3 ☔ View full report in Codecov by Harness. 🚀 New features to boost your workflow:
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
From #145