Skip to content

[Deprecation] Enact deprecations#1293

Merged
vmoens merged 2 commits intogh/vmoens/53/basefrom
gh/vmoens/53/head
Apr 23, 2025
Merged

[Deprecation] Enact deprecations#1293
vmoens merged 2 commits intogh/vmoens/53/basefrom
gh/vmoens/53/head

Conversation

@vmoens
Copy link
Copy Markdown
Collaborator

@vmoens vmoens commented Apr 23, 2025

[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Apr 23, 2025
ghstack-source-id: af6b270
Pull Request resolved: #1293
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 23, 2025
@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Apr 23, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 233. Improved: $\large\color{#35bf28}20$. Worsened: $\large\color{#d91a1a}13$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 26.3200μs 11.3951μs 87.7573 KOps/s 88.0551 KOps/s $\color{#d91a1a}-0.34\%$
test_plain_set_stack_nested 31.6610μs 11.4372μs 87.4343 KOps/s 87.1660 KOps/s $\color{#35bf28}+0.31\%$
test_plain_set_nested_inplace 34.9500μs 12.4234μs 80.4930 KOps/s 79.7088 KOps/s $\color{#35bf28}+0.98\%$
test_plain_set_stack_nested_inplace 42.1100μs 12.3941μs 80.6833 KOps/s 80.1076 KOps/s $\color{#35bf28}+0.72\%$
test_items 30.3710μs 2.8793μs 347.3037 KOps/s 341.7264 KOps/s $\color{#35bf28}+1.63\%$
test_items_nested 0.4057ms 0.3630ms 2.7548 KOps/s 2.7315 KOps/s $\color{#35bf28}+0.85\%$
test_items_nested_locked 0.4275ms 0.3647ms 2.7418 KOps/s 2.7362 KOps/s $\color{#35bf28}+0.20\%$
test_items_nested_leaf 97.9310μs 59.9788μs 16.6726 KOps/s 16.5685 KOps/s $\color{#35bf28}+0.63\%$
test_items_stack_nested 0.3997ms 0.3626ms 2.7579 KOps/s 2.7262 KOps/s $\color{#35bf28}+1.16\%$
test_items_stack_nested_leaf 95.1000μs 60.9177μs 16.4156 KOps/s 16.5655 KOps/s $\color{#d91a1a}-0.90\%$
test_items_stack_nested_locked 0.4340ms 0.3632ms 2.7536 KOps/s 2.6898 KOps/s $\color{#35bf28}+2.37\%$
test_keys 29.3300μs 3.4082μs 293.4140 KOps/s 288.1985 KOps/s $\color{#35bf28}+1.81\%$
test_keys_nested 0.1239ms 87.1001μs 11.4810 KOps/s 11.1904 KOps/s $\color{#35bf28}+2.60\%$
test_keys_nested_locked 2.7766ms 93.8630μs 10.6538 KOps/s 10.4800 KOps/s $\color{#35bf28}+1.66\%$
test_keys_nested_leaf 0.1113ms 78.8856μs 12.6766 KOps/s 12.4925 KOps/s $\color{#35bf28}+1.47\%$
test_keys_stack_nested 0.1390ms 87.0875μs 11.4827 KOps/s 11.2148 KOps/s $\color{#35bf28}+2.39\%$
test_keys_stack_nested_leaf 0.1149ms 78.2139μs 12.7855 KOps/s 12.4714 KOps/s $\color{#35bf28}+2.52\%$
test_keys_stack_nested_locked 0.1320ms 93.2623μs 10.7224 KOps/s 10.5497 KOps/s $\color{#35bf28}+1.64\%$
test_values 4.8618μs 0.8619μs 1.1603 MOps/s 1.1761 MOps/s $\color{#d91a1a}-1.34\%$
test_values_nested 67.2510μs 37.4061μs 26.7336 KOps/s 26.4073 KOps/s $\color{#35bf28}+1.24\%$
test_values_nested_locked 83.9010μs 39.2093μs 25.5041 KOps/s 25.0994 KOps/s $\color{#35bf28}+1.61\%$
test_values_nested_leaf 72.9710μs 42.3213μs 23.6288 KOps/s 23.1393 KOps/s $\color{#35bf28}+2.12\%$
test_values_stack_nested 80.5810μs 37.4293μs 26.7170 KOps/s 26.2310 KOps/s $\color{#35bf28}+1.85\%$
test_values_stack_nested_leaf 78.7910μs 42.6125μs 23.4673 KOps/s 23.2946 KOps/s $\color{#35bf28}+0.74\%$
test_values_stack_nested_locked 70.5210μs 39.2146μs 25.5007 KOps/s 25.1148 KOps/s $\color{#35bf28}+1.54\%$
test_membership 2.1720μs 0.4998μs 2.0008 MOps/s 1.9873 MOps/s $\color{#35bf28}+0.68\%$
test_membership_nested 54.5210μs 2.1358μs 468.2096 KOps/s 508.7406 KOps/s $\textbf{\color{#d91a1a}-7.97\%}$
test_membership_nested_leaf 17.7855μs 2.0493μs 487.9739 KOps/s 498.3709 KOps/s $\color{#d91a1a}-2.09\%$
test_membership_stacked_nested 25.6800μs 2.1250μs 470.5848 KOps/s 484.6620 KOps/s $\color{#d91a1a}-2.90\%$
test_membership_stacked_nested_leaf 39.0300μs 2.1073μs 474.5520 KOps/s 485.9128 KOps/s $\color{#d91a1a}-2.34\%$
test_membership_nested_last 34.2900μs 3.0986μs 322.7244 KOps/s 327.7213 KOps/s $\color{#d91a1a}-1.52\%$
test_membership_nested_leaf_last 43.2710μs 3.1272μs 319.7748 KOps/s 328.1324 KOps/s $\color{#d91a1a}-2.55\%$
test_membership_stacked_nested_last 36.6010μs 3.0730μs 325.4180 KOps/s 326.8815 KOps/s $\color{#d91a1a}-0.45\%$
test_membership_stacked_nested_leaf_last 31.9900μs 3.1027μs 322.2970 KOps/s 328.8532 KOps/s $\color{#d91a1a}-1.99\%$
test_nested_getleaf 0.6813ms 13.0455μs 76.6545 KOps/s 76.9756 KOps/s $\color{#d91a1a}-0.42\%$
test_nested_get 49.8810μs 12.4764μs 80.1512 KOps/s 80.5962 KOps/s $\color{#d91a1a}-0.55\%$
test_stacked_getleaf 49.1300μs 13.1517μs 76.0358 KOps/s 76.6277 KOps/s $\color{#d91a1a}-0.77\%$
test_stacked_get 56.6810μs 12.5046μs 79.9708 KOps/s 80.2443 KOps/s $\color{#d91a1a}-0.34\%$
test_nested_getitemleaf 44.4800μs 13.5180μs 73.9754 KOps/s 74.2135 KOps/s $\color{#d91a1a}-0.32\%$
test_nested_getitem 45.2210μs 12.8011μs 78.1181 KOps/s 78.0106 KOps/s $\color{#35bf28}+0.14\%$
test_stacked_getitemleaf 45.0510μs 13.4920μs 74.1178 KOps/s 74.3524 KOps/s $\color{#d91a1a}-0.32\%$
test_stacked_getitem 41.5600μs 12.7969μs 78.1437 KOps/s 78.6618 KOps/s $\color{#d91a1a}-0.66\%$
test_lock_nested 5.9592ms 0.3601ms 2.7773 KOps/s 2.7601 KOps/s $\color{#35bf28}+0.62\%$
test_lock_stack_nested 0.3956ms 0.3511ms 2.8480 KOps/s 2.8449 KOps/s $\color{#35bf28}+0.11\%$
test_unlock_nested 0.5153ms 0.2966ms 3.3721 KOps/s 3.3475 KOps/s $\color{#35bf28}+0.73\%$
test_unlock_stack_nested 0.3247ms 0.2905ms 3.4423 KOps/s 3.4389 KOps/s $\color{#35bf28}+0.10\%$
test_flatten_speed 0.1383ms 77.0199μs 12.9836 KOps/s 13.0238 KOps/s $\color{#d91a1a}-0.31\%$
test_unflatten_speed 0.5285ms 0.3951ms 2.5307 KOps/s 2.4787 KOps/s $\color{#35bf28}+2.10\%$
test_common_ops 0.8934ms 0.6437ms 1.5535 KOps/s 1.5509 KOps/s $\color{#35bf28}+0.17\%$
test_creation 88.8910μs 1.7382μs 575.3050 KOps/s 570.1392 KOps/s $\color{#35bf28}+0.91\%$
test_creation_empty 0.7435ms 7.1173μs 140.5032 KOps/s 139.3301 KOps/s $\color{#35bf28}+0.84\%$
test_creation_nested_1 0.1028ms 10.0146μs 99.8539 KOps/s 98.3598 KOps/s $\color{#35bf28}+1.52\%$
test_creation_nested_2 0.1024ms 12.9660μs 77.1250 KOps/s 77.3952 KOps/s $\color{#d91a1a}-0.35\%$
test_clone 81.7210μs 11.0393μs 90.5852 KOps/s 88.8025 KOps/s $\color{#35bf28}+2.01\%$
test_getitem[int] 0.1743ms 10.7534μs 92.9941 KOps/s 65.0422 KOps/s $\textbf{\color{#35bf28}+42.98\%}$
test_getitem[slice_int] 0.1113ms 21.2513μs 47.0559 KOps/s 46.3397 KOps/s $\color{#35bf28}+1.55\%$
test_getitem[range] 0.1378ms 38.0686μs 26.2684 KOps/s 25.6795 KOps/s $\color{#35bf28}+2.29\%$
test_getitem[tuple] 0.1050ms 18.4569μs 54.1802 KOps/s 53.4414 KOps/s $\color{#35bf28}+1.38\%$
test_getitem[list] 0.1981ms 34.2767μs 29.1743 KOps/s 29.3093 KOps/s $\color{#d91a1a}-0.46\%$
test_setitem_dim[int] 46.5510μs 20.0104μs 49.9740 KOps/s 49.8620 KOps/s $\color{#35bf28}+0.22\%$
test_setitem_dim[slice_int] 69.3200μs 40.9647μs 24.4113 KOps/s 25.5901 KOps/s $\color{#d91a1a}-4.61\%$
test_setitem_dim[range] 80.5110μs 53.2320μs 18.7857 KOps/s 18.4780 KOps/s $\color{#35bf28}+1.67\%$
test_setitem_dim[tuple] 68.0810μs 32.8810μs 30.4127 KOps/s 30.2308 KOps/s $\color{#35bf28}+0.60\%$
test_setitem 0.2883ms 16.1546μs 61.9020 KOps/s 61.4362 KOps/s $\color{#35bf28}+0.76\%$
test_set 0.2654ms 15.5413μs 64.3449 KOps/s 63.9551 KOps/s $\color{#35bf28}+0.61\%$
test_set_shared 0.5320ms 0.1607ms 6.2216 KOps/s 6.2070 KOps/s $\color{#35bf28}+0.23\%$
test_update 0.5236ms 18.6518μs 53.6141 KOps/s 51.9218 KOps/s $\color{#35bf28}+3.26\%$
test_update_nested 0.1284ms 29.2541μs 34.1833 KOps/s 33.5845 KOps/s $\color{#35bf28}+1.78\%$
test_update__nested 0.1119ms 25.9729μs 38.5017 KOps/s 38.9553 KOps/s $\color{#d91a1a}-1.16\%$
test_set_nested 0.1467ms 16.6465μs 60.0726 KOps/s 58.2993 KOps/s $\color{#35bf28}+3.04\%$
test_set_nested_new 0.1406ms 20.1417μs 49.6483 KOps/s 50.3342 KOps/s $\color{#d91a1a}-1.36\%$
test_select 0.1169ms 32.0980μs 31.1546 KOps/s 30.6094 KOps/s $\color{#35bf28}+1.78\%$
test_select_nested 80.3010μs 43.6157μs 22.9275 KOps/s 22.7276 KOps/s $\color{#35bf28}+0.88\%$
test_exclude_nested 0.1047ms 63.7080μs 15.6966 KOps/s 15.5603 KOps/s $\color{#35bf28}+0.88\%$
test_empty[True] 0.5500ms 0.2972ms 3.3650 KOps/s 3.3616 KOps/s $\color{#35bf28}+0.10\%$
test_empty[False] 2.3740μs 0.7986μs 1.2522 MOps/s 1.2089 MOps/s $\color{#35bf28}+3.58\%$
test_to 87.7310μs 57.4519μs 17.4059 KOps/s 17.4727 KOps/s $\color{#d91a1a}-0.38\%$
test_to_nonblocking 89.2410μs 50.5254μs 19.7920 KOps/s 20.0150 KOps/s $\color{#d91a1a}-1.11\%$
test_unbind_speed 0.7349ms 0.2445ms 4.0892 KOps/s 4.0566 KOps/s $\color{#35bf28}+0.80\%$
test_unbind_speed_stack0 0.3290ms 0.2456ms 4.0717 KOps/s 4.0428 KOps/s $\color{#35bf28}+0.72\%$
test_unbind_speed_stack1 91.9956ms 0.7446ms 1.3431 KOps/s 1.4591 KOps/s $\textbf{\color{#d91a1a}-7.95\%}$
test_split 93.1163ms 1.5971ms 626.1238 Ops/s 618.5423 Ops/s $\color{#35bf28}+1.23\%$
test_chunk 95.6443ms 1.6150ms 619.1793 Ops/s 616.4915 Ops/s $\color{#35bf28}+0.44\%$
test_consolidate[False-None] 94.7933ms 3.0224ms 330.8598 Ops/s 327.7772 Ops/s $\color{#35bf28}+0.94\%$
test_consolidate[default-None] 1.7763ms 1.6941ms 590.2772 Ops/s 593.5531 Ops/s $\color{#d91a1a}-0.55\%$
test_consolidate[reduce-overhead-None] 1.7781ms 1.7361ms 575.9994 Ops/s 582.3543 Ops/s $\color{#d91a1a}-1.09\%$
test_consolidate_njt[False-None] 6.7974ms 6.4912ms 154.0542 Ops/s 153.3339 Ops/s $\color{#35bf28}+0.47\%$
test_to[False-False-None] 1.8512ms 1.7710ms 564.6598 Ops/s 556.4661 Ops/s $\color{#35bf28}+1.47\%$
test_to[True-False-None] 1.9110ms 1.4250ms 701.7683 Ops/s 707.8925 Ops/s $\color{#d91a1a}-0.87\%$
test_to[within-False-None] 4.3712ms 4.3100ms 232.0175 Ops/s 230.6198 Ops/s $\color{#35bf28}+0.61\%$
test_to[True-default-None] 5.4611ms 5.2548ms 190.3037 Ops/s 188.4826 Ops/s $\color{#35bf28}+0.97\%$
test_to_njt[False-False-None] 7.0585ms 6.8905ms 145.1271 Ops/s 144.2890 Ops/s $\color{#35bf28}+0.58\%$
test_to_njt[True-False-None] 5.6243ms 5.4776ms 182.5623 Ops/s 181.3942 Ops/s $\color{#35bf28}+0.64\%$
test_to_njt[within-False-None] 0.3177s 15.7024ms 63.6844 Ops/s 82.4875 Ops/s $\textbf{\color{#d91a1a}-22.80\%}$
test_creation[device0] 0.5442ms 79.3172μs 12.6076 KOps/s 12.5679 KOps/s $\color{#35bf28}+0.32\%$
test_creation_from_tensor 0.4557ms 82.7791μs 12.0803 KOps/s 12.0019 KOps/s $\color{#35bf28}+0.65\%$
test_add_one[memmap_tensor0] 0.3519ms 6.9543μs 143.7966 KOps/s 144.4817 KOps/s $\color{#d91a1a}-0.47\%$
test_contiguous[memmap_tensor0] 2.2591μs 0.4070μs 2.4569 MOps/s 2.5099 MOps/s $\color{#d91a1a}-2.11\%$
test_stack[memmap_tensor0] 37.3110μs 4.4531μs 224.5639 KOps/s 218.0799 KOps/s $\color{#35bf28}+2.97\%$
test_memmaptd_index 1.4875ms 0.2401ms 4.1651 KOps/s 4.1591 KOps/s $\color{#35bf28}+0.14\%$
test_memmaptd_index_astensor 0.4259ms 0.3018ms 3.3136 KOps/s 3.2694 KOps/s $\color{#35bf28}+1.35\%$
test_memmaptd_index_op 0.9455ms 0.5568ms 1.7959 KOps/s 1.7745 KOps/s $\color{#35bf28}+1.21\%$
test_serialize_model 0.1327s 0.1320s 7.5748 Ops/s 7.5778 Ops/s $\color{#d91a1a}-0.04\%$
test_serialize_model_pickle 1.3496s 1.2144s 0.8235 Ops/s 0.8230 Ops/s $\color{#35bf28}+0.06\%$
test_serialize_weights 0.1320s 0.1312s 7.6207 Ops/s 7.5893 Ops/s $\color{#35bf28}+0.41\%$
test_serialize_weights_returnearly 0.3130s 53.0066ms 18.8656 Ops/s 23.7238 Ops/s $\textbf{\color{#d91a1a}-20.48\%}$
test_serialize_weights_pickle 1.3777s 1.2181s 0.8209 Ops/s 0.8212 Ops/s $\color{#d91a1a}-0.04\%$
test_reshape_pytree 59.5710μs 21.8974μs 45.6676 KOps/s 45.3627 KOps/s $\color{#35bf28}+0.67\%$
test_reshape_td 69.6910μs 26.4608μs 37.7917 KOps/s 37.5030 KOps/s $\color{#35bf28}+0.77\%$
test_view_pytree 0.4072ms 21.7304μs 46.0185 KOps/s 46.3674 KOps/s $\color{#d91a1a}-0.75\%$
test_view_td 79.4310μs 31.5882μs 31.6574 KOps/s 32.2365 KOps/s $\color{#d91a1a}-1.80\%$
test_unbind_pytree 0.4297ms 28.6356μs 34.9216 KOps/s 35.0026 KOps/s $\color{#d91a1a}-0.23\%$
test_unbind_td 0.6049ms 37.9012μs 26.3844 KOps/s 26.0809 KOps/s $\color{#35bf28}+1.16\%$
test_split_pytree 0.4127ms 29.8256μs 33.5282 KOps/s 32.4958 KOps/s $\color{#35bf28}+3.18\%$
test_split_td 0.8095ms 38.7740μs 25.7905 KOps/s 25.5666 KOps/s $\color{#35bf28}+0.88\%$
test_add_pytree 0.4289ms 35.1656μs 28.4369 KOps/s 28.4716 KOps/s $\color{#d91a1a}-0.12\%$
test_add_td 0.2821ms 49.7942μs 20.0827 KOps/s 19.9964 KOps/s $\color{#35bf28}+0.43\%$
test_compile_add_one_nested[tensordict-compile] 0.1890ms 0.1224ms 8.1670 KOps/s 7.8917 KOps/s $\color{#35bf28}+3.49\%$
test_compile_add_one_nested[tensordict-eager] 0.2385ms 0.1432ms 6.9821 KOps/s 6.9623 KOps/s $\color{#35bf28}+0.28\%$
test_compile_add_one_nested[pytree-compile] 0.1444ms 96.4811μs 10.3647 KOps/s 10.0236 KOps/s $\color{#35bf28}+3.40\%$
test_compile_add_one_nested[pytree-eager] 1.5847ms 0.1551ms 6.4469 KOps/s 6.1187 KOps/s $\textbf{\color{#35bf28}+5.36\%}$
test_compile_copy_nested[tensordict-compile] 60.7510μs 24.5390μs 40.7514 KOps/s 38.5137 KOps/s $\textbf{\color{#35bf28}+5.81\%}$
test_compile_copy_nested[tensordict-eager] 76.8910μs 34.8730μs 28.6755 KOps/s 27.9233 KOps/s $\color{#35bf28}+2.69\%$
test_compile_copy_nested[pytree-compile] 0.4157ms 64.0350μs 15.6165 KOps/s 15.4166 KOps/s $\color{#35bf28}+1.30\%$
test_compile_copy_nested[pytree-eager] 92.8310μs 49.3413μs 20.2670 KOps/s 20.4444 KOps/s $\color{#d91a1a}-0.87\%$
test_compile_add_one_flat[tensordict-compile] 0.1974ms 0.1417ms 7.0564 KOps/s 6.4796 KOps/s $\textbf{\color{#35bf28}+8.90\%}$
test_compile_add_one_flat[tensordict-eager] 0.6060ms 0.2217ms 4.5107 KOps/s 4.3971 KOps/s $\color{#35bf28}+2.58\%$
test_compile_add_one_flat[tensorclass-compile] 0.5061ms 96.8055μs 10.3300 KOps/s 9.6296 KOps/s $\textbf{\color{#35bf28}+7.27\%}$
test_compile_add_one_flat[tensorclass-eager] 0.5139ms 57.8448μs 17.2876 KOps/s 16.5717 KOps/s $\color{#35bf28}+4.32\%$
test_compile_add_one_flat[pytree-compile] 0.1982ms 0.1363ms 7.3384 KOps/s 6.8540 KOps/s $\textbf{\color{#35bf28}+7.07\%}$
test_compile_add_one_flat[pytree-eager] 0.9081ms 0.5093ms 1.9633 KOps/s 1.9254 KOps/s $\color{#35bf28}+1.97\%$
test_compile_add_self_flat[tensordict-eager] 0.6679ms 0.2674ms 3.7404 KOps/s 3.6151 KOps/s $\color{#35bf28}+3.46\%$
test_compile_add_self_flat[tensordict-compile] 0.1813ms 0.1422ms 7.0302 KOps/s 6.4658 KOps/s $\textbf{\color{#35bf28}+8.73\%}$
test_compile_add_self_flat[tensorclass-eager] 0.4806ms 71.9116μs 13.9060 KOps/s 13.5966 KOps/s $\color{#35bf28}+2.28\%$
test_compile_add_self_flat[tensorclass-compile] 0.1527ms 97.1135μs 10.2972 KOps/s 9.5990 KOps/s $\textbf{\color{#35bf28}+7.27\%}$
test_compile_add_self_flat[pytree-eager] 0.5313ms 0.4280ms 2.3362 KOps/s 2.3264 KOps/s $\color{#35bf28}+0.42\%$
test_compile_add_self_flat[pytree-compile] 0.1961ms 0.1350ms 7.4056 KOps/s 7.0233 KOps/s $\textbf{\color{#35bf28}+5.44\%}$
test_compile_copy_flat[tensordict-compile] 58.7700μs 19.3288μs 51.7362 KOps/s 53.0137 KOps/s $\color{#d91a1a}-2.41\%$
test_compile_copy_flat[tensordict-eager] 0.1805ms 31.9835μs 31.2661 KOps/s 30.3573 KOps/s $\color{#35bf28}+2.99\%$
test_compile_copy_flat[pytree-compile] 0.1012ms 69.3729μs 14.4148 KOps/s 14.4584 KOps/s $\color{#d91a1a}-0.30\%$
test_compile_copy_flat[pytree-eager] 85.5110μs 53.0354μs 18.8553 KOps/s 19.2538 KOps/s $\color{#d91a1a}-2.07\%$
test_compile_assign_and_add[tensordict-compile] 1.6339ms 0.3930ms 2.5444 KOps/s 2.1910 KOps/s $\textbf{\color{#35bf28}+16.13\%}$
test_compile_assign_and_add[tensordict-eager] 2.9832ms 2.8525ms 350.5658 Ops/s 351.7466 Ops/s $\color{#d91a1a}-0.34\%$
test_compile_assign_and_add[pytree-compile] 1.5868ms 0.4316ms 2.3170 KOps/s 2.2641 KOps/s $\color{#35bf28}+2.34\%$
test_compile_assign_and_add[pytree-eager] 2.8265ms 2.7073ms 369.3717 Ops/s 367.1360 Ops/s $\color{#35bf28}+0.61\%$
test_compile_indexing[tensor-tensordict-compile] 0.4068ms 0.1186ms 8.4319 KOps/s 8.8509 KOps/s $\color{#d91a1a}-4.73\%$
test_compile_indexing[tensor-tensordict-eager] 0.5479ms 83.8790μs 11.9219 KOps/s 11.3513 KOps/s $\textbf{\color{#35bf28}+5.03\%}$
test_compile_indexing[tensor-tensorclass-compile] 0.2281ms 0.1082ms 9.2452 KOps/s 9.1376 KOps/s $\color{#35bf28}+1.18\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1404ms 69.7981μs 14.3270 KOps/s 14.0086 KOps/s $\color{#35bf28}+2.27\%$
test_compile_indexing[tensor-pytree-compile] 0.1700ms 0.1097ms 9.1175 KOps/s 8.9587 KOps/s $\color{#35bf28}+1.77\%$
test_compile_indexing[tensor-pytree-eager] 0.1274ms 70.1851μs 14.2480 KOps/s 13.8464 KOps/s $\color{#35bf28}+2.90\%$
test_compile_indexing[slice-tensordict-compile] 0.1515ms 0.1020ms 9.8028 KOps/s 9.7770 KOps/s $\color{#35bf28}+0.26\%$
test_compile_indexing[slice-tensordict-eager] 0.1534ms 20.3072μs 49.2436 KOps/s 51.9006 KOps/s $\textbf{\color{#d91a1a}-5.12\%}$
test_compile_indexing[slice-tensorclass-compile] 0.1419ms 94.9196μs 10.5352 KOps/s 10.1591 KOps/s $\color{#35bf28}+3.70\%$
test_compile_indexing[slice-tensorclass-eager] 61.3910μs 15.8286μs 63.1769 KOps/s 61.9002 KOps/s $\color{#35bf28}+2.06\%$
test_compile_indexing[slice-pytree-compile] 0.1432ms 95.2595μs 10.4976 KOps/s 10.2053 KOps/s $\color{#35bf28}+2.86\%$
test_compile_indexing[slice-pytree-eager] 58.3800μs 16.0240μs 62.4063 KOps/s 61.4823 KOps/s $\color{#35bf28}+1.50\%$
test_compile_indexing[int-tensordict-compile] 0.1419ms 98.8902μs 10.1122 KOps/s 10.0393 KOps/s $\color{#35bf28}+0.73\%$
test_compile_indexing[int-tensordict-eager] 0.5989ms 18.5018μs 54.0488 KOps/s 51.4978 KOps/s $\color{#35bf28}+4.95\%$
test_compile_indexing[int-tensorclass-compile] 0.1527ms 0.1004ms 9.9575 KOps/s 10.3298 KOps/s $\color{#d91a1a}-3.60\%$
test_compile_indexing[int-tensorclass-eager] 0.1099ms 16.8312μs 59.4135 KOps/s 59.9771 KOps/s $\color{#d91a1a}-0.94\%$
test_compile_indexing[int-pytree-compile] 0.1547ms 0.1007ms 9.9256 KOps/s 9.8528 KOps/s $\color{#35bf28}+0.74\%$
test_compile_indexing[int-pytree-eager] 42.9900μs 15.9444μs 62.7178 KOps/s 58.0583 KOps/s $\textbf{\color{#35bf28}+8.03\%}$
test_mod_add[eager] 79.2010μs 37.5222μs 26.6509 KOps/s 24.5738 KOps/s $\textbf{\color{#35bf28}+8.45\%}$
test_mod_add[compile] 0.1445ms 81.3383μs 12.2943 KOps/s 11.6159 KOps/s $\textbf{\color{#35bf28}+5.84\%}$
test_mod_add[compile-overhead] 0.3259ms 0.1691ms 5.9140 KOps/s 5.7226 KOps/s $\color{#35bf28}+3.34\%$
test_mod_wrap[eager] 0.3382ms 0.2597ms 3.8509 KOps/s 3.7297 KOps/s $\color{#35bf28}+3.25\%$
test_mod_wrap[compile] 0.4070ms 0.2982ms 3.3531 KOps/s 3.3184 KOps/s $\color{#35bf28}+1.05\%$
test_mod_wrap[compile-overhead] 7.3350ms 3.8152ms 262.1118 Ops/s 266.9108 Ops/s $\color{#d91a1a}-1.80\%$
test_mod_wrap_and_backward[eager] 1.6674ms 1.5058ms 664.1011 Ops/s 676.7922 Ops/s $\color{#d91a1a}-1.88\%$
test_mod_wrap_and_backward[compile] 1.4310ms 1.2759ms 783.7718 Ops/s 715.4035 Ops/s $\textbf{\color{#35bf28}+9.56\%}$
test_mod_wrap_and_backward[compile-overhead] 1.4560ms 0.9388ms 1.0652 KOps/s 952.4134 Ops/s $\textbf{\color{#35bf28}+11.84\%}$
test_seq_add[eager] 0.3425ms 0.1330ms 7.5189 KOps/s 7.7647 KOps/s $\color{#d91a1a}-3.17\%$
test_seq_add[compile] 0.2501ms 90.8193μs 11.0109 KOps/s 10.9789 KOps/s $\color{#35bf28}+0.29\%$
test_seq_add[compile-overhead] 0.1761ms 0.1294ms 7.7287 KOps/s 7.6715 KOps/s $\color{#35bf28}+0.75\%$
test_seq_wrap[eager] 1.0684ms 0.4402ms 2.2719 KOps/s 2.2619 KOps/s $\color{#35bf28}+0.44\%$
test_seq_wrap[compile] 1.1770ms 0.3201ms 3.1242 KOps/s 3.2243 KOps/s $\color{#d91a1a}-3.11\%$
test_seq_wrap[compile-overhead] 0.2829ms 0.2309ms 4.3310 KOps/s 4.4016 KOps/s $\color{#d91a1a}-1.60\%$
test_func_call_runtime[False-eager] 1.0061ms 0.7826ms 1.2779 KOps/s 1.3252 KOps/s $\color{#d91a1a}-3.57\%$
test_func_call_runtime[False-compile] 0.8116ms 0.7447ms 1.3428 KOps/s 1.3143 KOps/s $\color{#35bf28}+2.17\%$
test_func_call_runtime[False-compile-overhead] 0.4223ms 0.3594ms 2.7827 KOps/s 2.7597 KOps/s $\color{#35bf28}+0.83\%$
test_func_call_runtime[True-eager] 0.9723ms 0.9046ms 1.1055 KOps/s 1.0836 KOps/s $\color{#35bf28}+2.02\%$
test_func_call_runtime[True-compile] 0.8898ms 0.7676ms 1.3027 KOps/s 1.2841 KOps/s $\color{#35bf28}+1.45\%$
test_func_call_runtime[True-compile-overhead] 0.4506ms 0.3803ms 2.6293 KOps/s 2.5857 KOps/s $\color{#35bf28}+1.69\%$
test_func_call_cm_runtime[False-eager] 0.9200ms 0.7394ms 1.3524 KOps/s 1.2660 KOps/s $\textbf{\color{#35bf28}+6.82\%}$
test_func_call_cm_runtime[False-compile] 0.8251ms 0.7471ms 1.3385 KOps/s 1.3075 KOps/s $\color{#35bf28}+2.38\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4971ms 0.3625ms 2.7587 KOps/s 2.7363 KOps/s $\color{#35bf28}+0.82\%$
test_func_call_cm_runtime[True-eager] 1.1044ms 1.0160ms 984.2367 Ops/s 970.0317 Ops/s $\color{#35bf28}+1.46\%$
test_func_call_cm_runtime[True-compile] 1.2508ms 1.0484ms 953.8681 Ops/s 987.6730 Ops/s $\color{#d91a1a}-3.42\%$
test_func_call_cm_runtime[True-compile-overhead] 1.0651ms 0.9966ms 1.0035 KOps/s 986.0215 Ops/s $\color{#35bf28}+1.77\%$
test_vmap_func_call_cm_runtime[eager] 2.5575ms 2.1191ms 471.8919 Ops/s 468.9038 Ops/s $\color{#35bf28}+0.64\%$
test_vmap_func_call_cm_runtime[compile] 0.9611ms 0.8144ms 1.2280 KOps/s 1.2070 KOps/s $\color{#35bf28}+1.74\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5324ms 0.4121ms 2.4268 KOps/s 2.3967 KOps/s $\color{#35bf28}+1.26\%$
test_distributed 2.8178ms 0.2361ms 4.2363 KOps/s 6.9872 KOps/s $\textbf{\color{#d91a1a}-39.37\%}$
test_tdmodule 52.1310μs 20.3333μs 49.1803 KOps/s 48.0184 KOps/s $\color{#35bf28}+2.42\%$
test_tdmodule_dispatch 58.8610μs 38.5099μs 25.9673 KOps/s 26.3058 KOps/s $\color{#d91a1a}-1.29\%$
test_tdseq 49.0510μs 20.8818μs 47.8886 KOps/s 49.5958 KOps/s $\color{#d91a1a}-3.44\%$
test_tdseq_dispatch 71.0710μs 40.6758μs 24.5846 KOps/s 24.6576 KOps/s $\color{#d91a1a}-0.30\%$
test_instantiation_functorch 1.6743ms 1.5654ms 638.8239 Ops/s 643.6904 Ops/s $\color{#d91a1a}-0.76\%$
test_exec_functorch 0.1952ms 0.1445ms 6.9192 KOps/s 6.7673 KOps/s $\color{#35bf28}+2.24\%$
test_exec_functional_call 0.1958ms 0.1393ms 7.1813 KOps/s 7.1099 KOps/s $\color{#35bf28}+1.01\%$
test_exec_td_decorator 0.3935ms 0.1913ms 5.2265 KOps/s 5.2375 KOps/s $\color{#d91a1a}-0.21\%$
test_vmap_mlp_speed_decorator[True-True] 0.9261ms 0.6980ms 1.4326 KOps/s 1.4345 KOps/s $\color{#d91a1a}-0.14\%$
test_vmap_mlp_speed_decorator[True-False] 0.9696ms 0.7052ms 1.4179 KOps/s 1.4357 KOps/s $\color{#d91a1a}-1.24\%$
test_vmap_mlp_speed_decorator[False-True] 0.7698ms 0.6361ms 1.5722 KOps/s 1.6572 KOps/s $\textbf{\color{#d91a1a}-5.13\%}$
test_vmap_mlp_speed_decorator[False-False] 0.7621ms 0.6359ms 1.5725 KOps/s 1.6496 KOps/s $\color{#d91a1a}-4.67\%$
test_vmap_transformer_speed_decorator[True-True] 21.0091ms 19.6024ms 51.0140 Ops/s 51.4821 Ops/s $\color{#d91a1a}-0.91\%$
test_vmap_transformer_speed_decorator[True-False] 20.0412ms 19.4487ms 51.4173 Ops/s 51.7350 Ops/s $\color{#d91a1a}-0.61\%$
test_vmap_transformer_speed_decorator[False-True] 20.2710ms 19.3762ms 51.6098 Ops/s 52.0226 Ops/s $\color{#d91a1a}-0.79\%$
test_vmap_transformer_speed_decorator[False-False] 19.9192ms 19.3236ms 51.7501 Ops/s 51.9541 Ops/s $\color{#d91a1a}-0.39\%$
test_to_module_speed[True] 1.2624ms 0.9606ms 1.0411 KOps/s 1.0317 KOps/s $\color{#35bf28}+0.91\%$
test_to_module_speed[False] 1.4120ms 0.9397ms 1.0642 KOps/s 1.0273 KOps/s $\color{#35bf28}+3.59\%$
test_tc_init 0.1427ms 35.4700μs 28.1928 KOps/s 28.6021 KOps/s $\color{#d91a1a}-1.43\%$
test_tc_init_tensor_only 0.1021ms 10.7388μs 93.1200 KOps/s 92.0951 KOps/s $\color{#35bf28}+1.11\%$
test_tc_init_nested 0.1980ms 69.0362μs 14.4852 KOps/s 14.4156 KOps/s $\color{#35bf28}+0.48\%$
test_tc_first_layer_tensor 26.8200μs 0.8853μs 1.1296 MOps/s 1.2323 MOps/s $\textbf{\color{#d91a1a}-8.34\%}$
test_tc_first_layer_tensor_only 2.6341μs 0.4263μs 2.3458 MOps/s 2.3534 MOps/s $\color{#d91a1a}-0.32\%$
test_tc_first_layer_tensor_set 40.7100μs 2.9434μs 339.7458 KOps/s 340.2979 KOps/s $\color{#d91a1a}-0.16\%$
test_tc_first_layer_tensor_only_set 11.9600μs 1.7551μs 569.7531 KOps/s 566.2872 KOps/s $\color{#35bf28}+0.61\%$
test_tc_first_layer_nontensor 38.7600μs 2.3270μs 429.7406 KOps/s 428.7512 KOps/s $\color{#35bf28}+0.23\%$
test_tc_second_layer_tensor 26.0200μs 1.7390μs 575.0268 KOps/s 573.4767 KOps/s $\color{#35bf28}+0.27\%$
test_tc_second_layer_nontensor 36.5100μs 3.1622μs 316.2384 KOps/s 313.8679 KOps/s $\color{#35bf28}+0.76\%$
test_unbind 0.2331s 10.3875ms 96.2698 Ops/s 142.9149 Ops/s $\textbf{\color{#d91a1a}-32.64\%}$
test_full_like 5.9242ms 4.3785ms 228.3870 Ops/s 112.4693 Ops/s $\textbf{\color{#35bf28}+103.07\%}$
test_zeros_like 5.2335ms 4.3382ms 230.5126 Ops/s 231.0643 Ops/s $\color{#d91a1a}-0.24\%$
test_ones_like 9.2220ms 7.0586ms 141.6709 Ops/s 229.4670 Ops/s $\textbf{\color{#d91a1a}-38.26\%}$
test_clone 11.8163ms 9.2990ms 107.5389 Ops/s 154.9824 Ops/s $\textbf{\color{#d91a1a}-30.61\%}$
test_squeeze 78.6710μs 9.9286μs 100.7191 KOps/s 92.9277 KOps/s $\textbf{\color{#35bf28}+8.38\%}$
test_unsqueeze 0.1587ms 73.3211μs 13.6386 KOps/s 13.1308 KOps/s $\color{#35bf28}+3.87\%$
test_split 0.4118ms 0.1607ms 6.2230 KOps/s 5.7777 KOps/s $\textbf{\color{#35bf28}+7.71\%}$
test_permute 0.2502ms 0.1793ms 5.5779 KOps/s 5.4889 KOps/s $\color{#35bf28}+1.62\%$
test_stack 54.2812ms 51.4019ms 19.4545 Ops/s 54.5229 Ops/s $\textbf{\color{#d91a1a}-64.32\%}$
test_cat 51.5597ms 51.0063ms 19.6054 Ops/s 54.6659 Ops/s $\textbf{\color{#d91a1a}-64.14\%}$

[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Apr 23, 2025
ghstack-source-id: deac3f0
Pull Request resolved: #1293
@vmoens vmoens mentioned this pull request Apr 23, 2025
@vmoens vmoens merged commit 8ee20fb into gh/vmoens/53/base Apr 23, 2025
45 of 48 checks passed
vmoens pushed a commit that referenced this pull request Apr 23, 2025
ghstack-source-id: deac3f0
Pull Request resolved: #1293
@vmoens vmoens deleted the gh/vmoens/53/head branch April 23, 2025 11:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants