Skip to content

[Test] Deactivate failing test on 2.6#1289

Merged
vmoens merged 1 commit intogh/vmoens/52/basefrom
gh/vmoens/52/head
Apr 17, 2025
Merged

[Test] Deactivate failing test on 2.6#1289
vmoens merged 1 commit intogh/vmoens/52/basefrom
gh/vmoens/52/head

Conversation

@vmoens
Copy link
Copy Markdown
Collaborator

@vmoens vmoens commented Apr 17, 2025

[ghstack-poisoned]
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 17, 2025
@vmoens vmoens merged commit b3de701 into gh/vmoens/52/base Apr 17, 2025
31 of 34 checks passed
vmoens pushed a commit that referenced this pull request Apr 17, 2025
ghstack-source-id: 7a9ad08
Pull Request resolved: #1289
@vmoens vmoens deleted the gh/vmoens/52/head branch April 17, 2025 09:51
@github-actions
Copy link
Copy Markdown
Contributor

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 233. Improved: $\large\color{#35bf28}11$. Worsened: $\large\color{#d91a1a}21$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 32.0600μs 11.3559μs 88.0601 KOps/s 88.3073 KOps/s $\color{#d91a1a}-0.28\%$
test_plain_set_stack_nested 33.3110μs 11.3705μs 87.9471 KOps/s 86.9111 KOps/s $\color{#35bf28}+1.19\%$
test_plain_set_nested_inplace 39.0500μs 12.3519μs 80.9591 KOps/s 80.4367 KOps/s $\color{#35bf28}+0.65\%$
test_plain_set_stack_nested_inplace 50.0610μs 12.4262μs 80.4752 KOps/s 80.5967 KOps/s $\color{#d91a1a}-0.15\%$
test_items 0.1637ms 2.9377μs 340.4031 KOps/s 343.6360 KOps/s $\color{#d91a1a}-0.94\%$
test_items_nested 0.5525ms 0.3627ms 2.7571 KOps/s 2.7415 KOps/s $\color{#35bf28}+0.57\%$
test_items_nested_locked 0.4036ms 0.3638ms 2.7486 KOps/s 2.7831 KOps/s $\color{#d91a1a}-1.24\%$
test_items_nested_leaf 0.1265ms 60.0910μs 16.6414 KOps/s 16.6752 KOps/s $\color{#d91a1a}-0.20\%$
test_items_stack_nested 0.4682ms 0.3644ms 2.7446 KOps/s 2.7366 KOps/s $\color{#35bf28}+0.29\%$
test_items_stack_nested_leaf 94.0910μs 60.3300μs 16.5755 KOps/s 16.6604 KOps/s $\color{#d91a1a}-0.51\%$
test_items_stack_nested_locked 0.4422ms 0.3698ms 2.7044 KOps/s 2.7222 KOps/s $\color{#d91a1a}-0.65\%$
test_keys 62.4110μs 3.4410μs 290.6128 KOps/s 290.2424 KOps/s $\color{#35bf28}+0.13\%$
test_keys_nested 0.1416ms 87.9650μs 11.3682 KOps/s 11.3509 KOps/s $\color{#35bf28}+0.15\%$
test_keys_nested_locked 0.8090ms 94.3331μs 10.6007 KOps/s 10.4814 KOps/s $\color{#35bf28}+1.14\%$
test_keys_nested_leaf 0.1045ms 79.1564μs 12.6332 KOps/s 12.5795 KOps/s $\color{#35bf28}+0.43\%$
test_keys_stack_nested 0.1192ms 88.2757μs 11.3281 KOps/s 11.3570 KOps/s $\color{#d91a1a}-0.25\%$
test_keys_stack_nested_leaf 0.1151ms 78.8856μs 12.6766 KOps/s 12.6743 KOps/s $\color{#35bf28}+0.02\%$
test_keys_stack_nested_locked 0.1188ms 93.9351μs 10.6456 KOps/s 10.6722 KOps/s $\color{#d91a1a}-0.25\%$
test_values 6.2083μs 0.8532μs 1.1721 MOps/s 1.1771 MOps/s $\color{#d91a1a}-0.42\%$
test_values_nested 75.7010μs 37.4195μs 26.7240 KOps/s 26.6247 KOps/s $\color{#35bf28}+0.37\%$
test_values_nested_locked 75.5310μs 39.4039μs 25.3782 KOps/s 25.2708 KOps/s $\color{#35bf28}+0.43\%$
test_values_nested_leaf 99.3610μs 42.2170μs 23.6871 KOps/s 23.5167 KOps/s $\color{#35bf28}+0.72\%$
test_values_stack_nested 65.3900μs 37.5524μs 26.6294 KOps/s 26.4705 KOps/s $\color{#35bf28}+0.60\%$
test_values_stack_nested_leaf 73.2000μs 42.4647μs 23.5490 KOps/s 23.4003 KOps/s $\color{#35bf28}+0.64\%$
test_values_stack_nested_locked 66.6010μs 39.5514μs 25.2836 KOps/s 25.2866 KOps/s $\color{#d91a1a}-0.01\%$
test_membership 3.7090μs 0.5037μs 1.9852 MOps/s 1.9912 MOps/s $\color{#d91a1a}-0.30\%$
test_membership_nested 14.1200μs 1.9912μs 502.1981 KOps/s 500.1606 KOps/s $\color{#35bf28}+0.41\%$
test_membership_nested_leaf 16.7200μs 1.9887μs 502.8371 KOps/s 494.6037 KOps/s $\color{#35bf28}+1.66\%$
test_membership_stacked_nested 30.9000μs 2.0591μs 485.6436 KOps/s 475.7752 KOps/s $\color{#35bf28}+2.07\%$
test_membership_stacked_nested_leaf 27.0600μs 2.0758μs 481.7452 KOps/s 483.8719 KOps/s $\color{#d91a1a}-0.44\%$
test_membership_nested_last 76.9710μs 3.0531μs 327.5332 KOps/s 329.4751 KOps/s $\color{#d91a1a}-0.59\%$
test_membership_nested_leaf_last 31.9700μs 3.0347μs 329.5232 KOps/s 326.7085 KOps/s $\color{#35bf28}+0.86\%$
test_membership_stacked_nested_last 25.4400μs 3.0240μs 330.6895 KOps/s 326.4937 KOps/s $\color{#35bf28}+1.29\%$
test_membership_stacked_nested_leaf_last 68.1410μs 2.9930μs 334.1172 KOps/s 325.0401 KOps/s $\color{#35bf28}+2.79\%$
test_nested_getleaf 0.1486ms 13.0838μs 76.4304 KOps/s 77.0258 KOps/s $\color{#d91a1a}-0.77\%$
test_nested_get 40.5800μs 12.3640μs 80.8797 KOps/s 81.0940 KOps/s $\color{#d91a1a}-0.26\%$
test_stacked_getleaf 49.5800μs 13.0415μs 76.6784 KOps/s 77.0848 KOps/s $\color{#d91a1a}-0.53\%$
test_stacked_get 39.7000μs 12.3523μs 80.9563 KOps/s 81.8838 KOps/s $\color{#d91a1a}-1.13\%$
test_nested_getitemleaf 46.2510μs 13.4263μs 74.4805 KOps/s 74.7434 KOps/s $\color{#d91a1a}-0.35\%$
test_nested_getitem 46.8500μs 12.6619μs 78.9773 KOps/s 78.6426 KOps/s $\color{#35bf28}+0.43\%$
test_stacked_getitemleaf 55.0300μs 13.3718μs 74.7842 KOps/s 74.4632 KOps/s $\color{#35bf28}+0.43\%$
test_stacked_getitem 40.7000μs 12.6507μs 79.0471 KOps/s 79.1624 KOps/s $\color{#d91a1a}-0.15\%$
test_lock_nested 0.7388ms 0.3638ms 2.7485 KOps/s 2.8686 KOps/s $\color{#d91a1a}-4.19\%$
test_lock_stack_nested 0.4831ms 0.3501ms 2.8560 KOps/s 2.9120 KOps/s $\color{#d91a1a}-1.92\%$
test_unlock_nested 0.5358ms 0.3067ms 3.2606 KOps/s 3.4615 KOps/s $\textbf{\color{#d91a1a}-5.80\%}$
test_unlock_stack_nested 0.4146ms 0.2916ms 3.4294 KOps/s 3.5642 KOps/s $\color{#d91a1a}-3.78\%$
test_flatten_speed 0.1447ms 76.4353μs 13.0830 KOps/s 12.9471 KOps/s $\color{#35bf28}+1.05\%$
test_unflatten_speed 0.4298ms 0.3903ms 2.5619 KOps/s 2.5026 KOps/s $\color{#35bf28}+2.37\%$
test_common_ops 0.9319ms 0.6446ms 1.5513 KOps/s 1.5890 KOps/s $\color{#d91a1a}-2.37\%$
test_creation 79.3610μs 1.7642μs 566.8365 KOps/s 566.9262 KOps/s $\color{#d91a1a}-0.02\%$
test_creation_empty 0.6185ms 7.2411μs 138.1014 KOps/s 139.7413 KOps/s $\color{#d91a1a}-1.17\%$
test_creation_nested_1 97.4210μs 10.1412μs 98.6077 KOps/s 99.7458 KOps/s $\color{#d91a1a}-1.14\%$
test_creation_nested_2 0.1059ms 12.9915μs 76.9734 KOps/s 77.8741 KOps/s $\color{#d91a1a}-1.16\%$
test_clone 88.3610μs 11.1092μs 90.0154 KOps/s 94.9171 KOps/s $\textbf{\color{#d91a1a}-5.16\%}$
test_getitem[int] 0.1497ms 10.6608μs 93.8018 KOps/s 95.6507 KOps/s $\color{#d91a1a}-1.93\%$
test_getitem[slice_int] 0.1412ms 21.5050μs 46.5009 KOps/s 50.0261 KOps/s $\textbf{\color{#d91a1a}-7.05\%}$
test_getitem[range] 0.1334ms 38.7319μs 25.8185 KOps/s 26.6415 KOps/s $\color{#d91a1a}-3.09\%$
test_getitem[tuple] 0.1112ms 18.1680μs 55.0418 KOps/s 55.8929 KOps/s $\color{#d91a1a}-1.52\%$
test_getitem[list] 0.1799ms 34.2452μs 29.2011 KOps/s 29.9212 KOps/s $\color{#d91a1a}-2.41\%$
test_setitem_dim[int] 42.4100μs 19.4409μs 51.4380 KOps/s 53.0024 KOps/s $\color{#d91a1a}-2.95\%$
test_setitem_dim[slice_int] 61.9310μs 38.4632μs 25.9989 KOps/s 26.3034 KOps/s $\color{#d91a1a}-1.16\%$
test_setitem_dim[range] 0.1656ms 53.3495μs 18.7443 KOps/s 19.0181 KOps/s $\color{#d91a1a}-1.44\%$
test_setitem_dim[tuple] 54.8610μs 32.5966μs 30.6780 KOps/s 30.8428 KOps/s $\color{#d91a1a}-0.53\%$
test_setitem 0.2266ms 15.9453μs 62.7144 KOps/s 65.8814 KOps/s $\color{#d91a1a}-4.81\%$
test_set 0.2378ms 15.4352μs 64.7870 KOps/s 68.9047 KOps/s $\textbf{\color{#d91a1a}-5.98\%}$
test_set_shared 0.6256ms 0.1607ms 6.2240 KOps/s 6.2761 KOps/s $\color{#d91a1a}-0.83\%$
test_update 0.2541ms 18.8710μs 52.9913 KOps/s 57.0988 KOps/s $\textbf{\color{#d91a1a}-7.19\%}$
test_update_nested 0.1557ms 29.3759μs 34.0415 KOps/s 36.1445 KOps/s $\textbf{\color{#d91a1a}-5.82\%}$
test_update__nested 73.9210μs 25.2924μs 39.5376 KOps/s 40.2838 KOps/s $\color{#d91a1a}-1.85\%$
test_set_nested 0.1411ms 16.5594μs 60.3887 KOps/s 63.5223 KOps/s $\color{#d91a1a}-4.93\%$
test_set_nested_new 0.1099ms 19.6852μs 50.7995 KOps/s 53.8198 KOps/s $\textbf{\color{#d91a1a}-5.61\%}$
test_select 0.1471ms 31.7686μs 31.4776 KOps/s 33.1156 KOps/s $\color{#d91a1a}-4.95\%$
test_select_nested 71.2100μs 43.3470μs 23.0696 KOps/s 23.2467 KOps/s $\color{#d91a1a}-0.76\%$
test_exclude_nested 90.4210μs 62.4986μs 16.0003 KOps/s 16.0479 KOps/s $\color{#d91a1a}-0.30\%$
test_empty[True] 0.3367ms 0.2960ms 3.3778 KOps/s 3.3879 KOps/s $\color{#d91a1a}-0.30\%$
test_empty[False] 8.5281μs 0.8062μs 1.2404 MOps/s 1.2115 MOps/s $\color{#35bf28}+2.38\%$
test_to 88.3110μs 58.2730μs 17.1606 KOps/s 17.1941 KOps/s $\color{#d91a1a}-0.19\%$
test_to_nonblocking 0.2129ms 50.0650μs 19.9740 KOps/s 20.1849 KOps/s $\color{#d91a1a}-1.04\%$
test_unbind_speed 0.8466ms 0.2502ms 3.9966 KOps/s 4.2213 KOps/s $\textbf{\color{#d91a1a}-5.32\%}$
test_unbind_speed_stack0 0.3154ms 0.2483ms 4.0276 KOps/s 4.2367 KOps/s $\color{#d91a1a}-4.94\%$
test_unbind_speed_stack1 99.0773ms 0.7575ms 1.3201 KOps/s 1.3389 KOps/s $\color{#d91a1a}-1.41\%$
test_split 96.4057ms 1.6075ms 622.0704 Ops/s 629.4036 Ops/s $\color{#d91a1a}-1.17\%$
test_chunk 98.7502ms 1.6089ms 621.5333 Ops/s 631.2474 Ops/s $\color{#d91a1a}-1.54\%$
test_consolidate[False-None] 97.5724ms 3.1400ms 318.4743 Ops/s 319.6422 Ops/s $\color{#d91a1a}-0.37\%$
test_consolidate[default-None] 1.9243ms 1.7349ms 576.3994 Ops/s 582.9945 Ops/s $\color{#d91a1a}-1.13\%$
test_consolidate[reduce-overhead-None] 1.9174ms 1.7728ms 564.0675 Ops/s 575.7074 Ops/s $\color{#d91a1a}-2.02\%$
test_consolidate_njt[False-None] 6.9777ms 6.5055ms 153.7167 Ops/s 152.0910 Ops/s $\color{#35bf28}+1.07\%$
test_to[False-False-None] 1.9080ms 1.7777ms 562.5268 Ops/s 560.3686 Ops/s $\color{#35bf28}+0.39\%$
test_to[True-False-None] 0.3031s 1.8542ms 539.3105 Ops/s 721.9701 Ops/s $\textbf{\color{#d91a1a}-25.30\%}$
test_to[within-False-None] 4.5259ms 4.3143ms 231.7860 Ops/s 232.8257 Ops/s $\color{#d91a1a}-0.45\%$
test_to[True-default-None] 5.6818ms 5.2907ms 189.0097 Ops/s 195.6375 Ops/s $\color{#d91a1a}-3.39\%$
test_to_njt[False-False-None] 7.1472ms 6.9090ms 144.7386 Ops/s 144.4133 Ops/s $\color{#35bf28}+0.23\%$
test_to_njt[True-False-None] 5.6919ms 5.4297ms 184.1726 Ops/s 183.5775 Ops/s $\color{#35bf28}+0.32\%$
test_to_njt[within-False-None] 12.4217ms 12.0162ms 83.2212 Ops/s 82.4118 Ops/s $\color{#35bf28}+0.98\%$
test_creation[device0] 0.6563ms 79.9084μs 12.5143 KOps/s 12.0148 KOps/s $\color{#35bf28}+4.16\%$
test_creation_from_tensor 0.7340ms 83.1870μs 12.0211 KOps/s 11.6583 KOps/s $\color{#35bf28}+3.11\%$
test_add_one[memmap_tensor0] 0.2741ms 7.2045μs 138.8028 KOps/s 146.4765 KOps/s $\textbf{\color{#d91a1a}-5.24\%}$
test_contiguous[memmap_tensor0] 1.9216μs 0.4190μs 2.3867 MOps/s 2.3714 MOps/s $\color{#35bf28}+0.65\%$
test_stack[memmap_tensor0] 53.0700μs 4.7462μs 210.6935 KOps/s 226.2956 KOps/s $\textbf{\color{#d91a1a}-6.89\%}$
test_memmaptd_index 1.6934ms 0.2482ms 4.0295 KOps/s 4.1836 KOps/s $\color{#d91a1a}-3.68\%$
test_memmaptd_index_astensor 0.4379ms 0.3096ms 3.2296 KOps/s 3.2829 KOps/s $\color{#d91a1a}-1.62\%$
test_memmaptd_index_op 0.9783ms 0.5779ms 1.7304 KOps/s 1.8173 KOps/s $\color{#d91a1a}-4.78\%$
test_serialize_model 0.1339s 0.1323s 7.5576 Ops/s 7.5530 Ops/s $\color{#35bf28}+0.06\%$
test_serialize_model_pickle 1.3498s 1.2117s 0.8253 Ops/s 0.8239 Ops/s $\color{#35bf28}+0.16\%$
test_serialize_weights 0.1329s 0.1315s 7.6030 Ops/s 5.4186 Ops/s $\textbf{\color{#35bf28}+40.31\%}$
test_serialize_weights_returnearly 0.3708s 63.5721ms 15.7302 Ops/s 23.4379 Ops/s $\textbf{\color{#d91a1a}-32.89\%}$
test_serialize_weights_pickle 1.3770s 1.2167s 0.8219 Ops/s 0.8438 Ops/s $\color{#d91a1a}-2.59\%$
test_reshape_pytree 60.1210μs 22.1973μs 45.0505 KOps/s 44.5412 KOps/s $\color{#35bf28}+1.14\%$
test_reshape_td 63.6910μs 27.1520μs 36.8297 KOps/s 36.3786 KOps/s $\color{#35bf28}+1.24\%$
test_view_pytree 0.1322ms 22.0003μs 45.4540 KOps/s 45.2364 KOps/s $\color{#35bf28}+0.48\%$
test_view_td 63.8010μs 30.9830μs 32.2758 KOps/s 30.4593 KOps/s $\textbf{\color{#35bf28}+5.96\%}$
test_unbind_pytree 0.1870ms 28.5160μs 35.0680 KOps/s 35.7871 KOps/s $\color{#d91a1a}-2.01\%$
test_unbind_td 0.7045ms 37.8008μs 26.4545 KOps/s 26.9552 KOps/s $\color{#d91a1a}-1.86\%$
test_split_pytree 0.1329ms 29.7082μs 33.6607 KOps/s 33.1790 KOps/s $\color{#35bf28}+1.45\%$
test_split_td 0.7618ms 38.8757μs 25.7230 KOps/s 25.5287 KOps/s $\color{#35bf28}+0.76\%$
test_add_pytree 0.1716ms 35.9791μs 27.7939 KOps/s 29.0814 KOps/s $\color{#d91a1a}-4.43\%$
test_add_td 0.2868ms 49.6598μs 20.1370 KOps/s 21.0608 KOps/s $\color{#d91a1a}-4.39\%$
test_compile_add_one_nested[tensordict-compile] 0.2750ms 0.1248ms 8.0126 KOps/s 7.7918 KOps/s $\color{#35bf28}+2.83\%$
test_compile_add_one_nested[tensordict-eager] 0.3317ms 0.1440ms 6.9461 KOps/s 6.9292 KOps/s $\color{#35bf28}+0.24\%$
test_compile_add_one_nested[pytree-compile] 0.2426ms 98.2477μs 10.1784 KOps/s 9.9547 KOps/s $\color{#35bf28}+2.25\%$
test_compile_add_one_nested[pytree-eager] 1.1265ms 0.1512ms 6.6147 KOps/s 6.5130 KOps/s $\color{#35bf28}+1.56\%$
test_compile_copy_nested[tensordict-compile] 0.1662ms 24.1554μs 41.3986 KOps/s 43.9635 KOps/s $\textbf{\color{#d91a1a}-5.83\%}$
test_compile_copy_nested[tensordict-eager] 0.1779ms 35.3852μs 28.2604 KOps/s 28.3941 KOps/s $\color{#d91a1a}-0.47\%$
test_compile_copy_nested[pytree-compile] 0.1836ms 64.2501μs 15.5642 KOps/s 15.5994 KOps/s $\color{#d91a1a}-0.23\%$
test_compile_copy_nested[pytree-eager] 0.1031ms 48.8623μs 20.4657 KOps/s 20.4615 KOps/s $\color{#35bf28}+0.02\%$
test_compile_add_one_flat[tensordict-compile] 0.1938ms 0.1433ms 6.9773 KOps/s 6.8554 KOps/s $\color{#35bf28}+1.78\%$
test_compile_add_one_flat[tensordict-eager] 0.3689ms 0.2211ms 4.5238 KOps/s 4.5520 KOps/s $\color{#d91a1a}-0.62\%$
test_compile_add_one_flat[tensorclass-compile] 0.2472ms 98.9211μs 10.1091 KOps/s 9.8503 KOps/s $\color{#35bf28}+2.63\%$
test_compile_add_one_flat[tensorclass-eager] 0.2098ms 58.9994μs 16.9493 KOps/s 16.7315 KOps/s $\color{#35bf28}+1.30\%$
test_compile_add_one_flat[pytree-compile] 0.2690ms 0.1394ms 7.1741 KOps/s 7.1277 KOps/s $\color{#35bf28}+0.65\%$
test_compile_add_one_flat[pytree-eager] 0.6801ms 0.4949ms 2.0206 KOps/s 2.0050 KOps/s $\color{#35bf28}+0.78\%$
test_compile_add_self_flat[tensordict-eager] 0.4353ms 0.2676ms 3.7367 KOps/s 3.7550 KOps/s $\color{#d91a1a}-0.49\%$
test_compile_add_self_flat[tensordict-compile] 0.2288ms 0.1454ms 6.8776 KOps/s 6.9074 KOps/s $\color{#d91a1a}-0.43\%$
test_compile_add_self_flat[tensorclass-eager] 0.2233ms 71.0880μs 14.0671 KOps/s 14.1775 KOps/s $\color{#d91a1a}-0.78\%$
test_compile_add_self_flat[tensorclass-compile] 0.1350ms 98.9794μs 10.1031 KOps/s 10.0752 KOps/s $\color{#35bf28}+0.28\%$
test_compile_add_self_flat[pytree-eager] 0.5701ms 0.4122ms 2.4260 KOps/s 2.3617 KOps/s $\color{#35bf28}+2.72\%$
test_compile_add_self_flat[pytree-compile] 0.2792ms 0.1358ms 7.3614 KOps/s 7.3560 KOps/s $\color{#35bf28}+0.07\%$
test_compile_copy_flat[tensordict-compile] 0.1567ms 19.2414μs 51.9713 KOps/s 55.7109 KOps/s $\textbf{\color{#d91a1a}-6.71\%}$
test_compile_copy_flat[tensordict-eager] 58.3210μs 32.5083μs 30.7614 KOps/s 31.4495 KOps/s $\color{#d91a1a}-2.19\%$
test_compile_copy_flat[pytree-compile] 0.1055ms 69.5157μs 14.3852 KOps/s 14.3122 KOps/s $\color{#35bf28}+0.51\%$
test_compile_copy_flat[pytree-eager] 0.1955ms 52.0652μs 19.2067 KOps/s 19.2682 KOps/s $\color{#d91a1a}-0.32\%$
test_compile_assign_and_add[tensordict-compile] 1.6397ms 0.3957ms 2.5269 KOps/s 2.1618 KOps/s $\textbf{\color{#35bf28}+16.89\%}$
test_compile_assign_and_add[tensordict-eager] 3.3778ms 2.8236ms 354.1636 Ops/s 362.5975 Ops/s $\color{#d91a1a}-2.33\%$
test_compile_assign_and_add[pytree-compile] 1.6281ms 0.4447ms 2.2487 KOps/s 2.2317 KOps/s $\color{#35bf28}+0.76\%$
test_compile_assign_and_add[pytree-eager] 2.8958ms 2.6896ms 371.8052 Ops/s 369.7721 Ops/s $\color{#35bf28}+0.55\%$
test_compile_indexing[tensor-tensordict-compile] 0.2827ms 0.1111ms 8.9988 KOps/s 8.6373 KOps/s $\color{#35bf28}+4.18\%$
test_compile_indexing[tensor-tensordict-eager] 0.5480ms 82.3983μs 12.1362 KOps/s 11.1335 KOps/s $\textbf{\color{#35bf28}+9.01\%}$
test_compile_indexing[tensor-tensorclass-compile] 0.2505ms 0.1072ms 9.3308 KOps/s 9.1980 KOps/s $\color{#35bf28}+1.44\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2347ms 68.1889μs 14.6652 KOps/s 13.5760 KOps/s $\textbf{\color{#35bf28}+8.02\%}$
test_compile_indexing[tensor-pytree-compile] 0.2821ms 0.1085ms 9.2177 KOps/s 8.7738 KOps/s $\textbf{\color{#35bf28}+5.06\%}$
test_compile_indexing[tensor-pytree-eager] 0.2320ms 67.7085μs 14.7692 KOps/s 13.7887 KOps/s $\textbf{\color{#35bf28}+7.11\%}$
test_compile_indexing[slice-tensordict-compile] 0.2507ms 0.1015ms 9.8474 KOps/s 9.6422 KOps/s $\color{#35bf28}+2.13\%$
test_compile_indexing[slice-tensordict-eager] 0.1423ms 18.8625μs 53.0151 KOps/s 51.3082 KOps/s $\color{#35bf28}+3.33\%$
test_compile_indexing[slice-tensorclass-compile] 0.2432ms 97.7096μs 10.2344 KOps/s 10.2959 KOps/s $\color{#d91a1a}-0.60\%$
test_compile_indexing[slice-tensorclass-eager] 0.1371ms 15.8665μs 63.0260 KOps/s 63.8423 KOps/s $\color{#d91a1a}-1.28\%$
test_compile_indexing[slice-pytree-compile] 0.2449ms 98.2404μs 10.1791 KOps/s 9.8426 KOps/s $\color{#35bf28}+3.42\%$
test_compile_indexing[slice-pytree-eager] 65.5810μs 15.7423μs 63.5232 KOps/s 64.7064 KOps/s $\color{#d91a1a}-1.83\%$
test_compile_indexing[int-tensordict-compile] 0.2651ms 0.1025ms 9.7537 KOps/s 9.8568 KOps/s $\color{#d91a1a}-1.05\%$
test_compile_indexing[int-tensordict-eager] 0.6239ms 18.6999μs 53.4763 KOps/s 52.5995 KOps/s $\color{#35bf28}+1.67\%$
test_compile_indexing[int-tensorclass-compile] 0.2474ms 98.1271μs 10.1909 KOps/s 9.7173 KOps/s $\color{#35bf28}+4.87\%$
test_compile_indexing[int-tensorclass-eager] 0.1989ms 15.6646μs 63.8383 KOps/s 65.7325 KOps/s $\color{#d91a1a}-2.88\%$
test_compile_indexing[int-pytree-compile] 0.2463ms 98.1550μs 10.1880 KOps/s 10.2472 KOps/s $\color{#d91a1a}-0.58\%$
test_compile_indexing[int-pytree-eager] 0.2198ms 15.7022μs 63.6852 KOps/s 65.0437 KOps/s $\color{#d91a1a}-2.09\%$
test_mod_add[eager] 0.2314ms 37.6203μs 26.5814 KOps/s 25.8463 KOps/s $\color{#35bf28}+2.84\%$
test_mod_add[compile] 0.2344ms 81.8085μs 12.2237 KOps/s 12.2126 KOps/s $\color{#35bf28}+0.09\%$
test_mod_add[compile-overhead] 0.3314ms 0.1741ms 5.7439 KOps/s 5.6425 KOps/s $\color{#35bf28}+1.80\%$
test_mod_wrap[eager] 0.4185ms 0.2522ms 3.9656 KOps/s 3.9439 KOps/s $\color{#35bf28}+0.55\%$
test_mod_wrap[compile] 0.4593ms 0.2967ms 3.3699 KOps/s 3.4527 KOps/s $\color{#d91a1a}-2.40\%$
test_mod_wrap[compile-overhead] 7.9356ms 3.8591ms 259.1272 Ops/s 263.8526 Ops/s $\color{#d91a1a}-1.79\%$
test_mod_wrap_and_backward[eager] 1.6895ms 1.5075ms 663.3365 Ops/s 680.6482 Ops/s $\color{#d91a1a}-2.54\%$
test_mod_wrap_and_backward[compile] 1.4562ms 1.2867ms 777.1983 Ops/s 714.2509 Ops/s $\textbf{\color{#35bf28}+8.81\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3627ms 0.9278ms 1.0778 KOps/s 952.7346 Ops/s $\textbf{\color{#35bf28}+13.12\%}$
test_seq_add[eager] 0.3299ms 0.1335ms 7.4914 KOps/s 7.8559 KOps/s $\color{#d91a1a}-4.64\%$
test_seq_add[compile] 0.2444ms 94.4297μs 10.5899 KOps/s 11.0118 KOps/s $\color{#d91a1a}-3.83\%$
test_seq_add[compile-overhead] 0.2695ms 0.1388ms 7.2064 KOps/s 7.5087 KOps/s $\color{#d91a1a}-4.03\%$
test_seq_wrap[eager] 1.0649ms 0.4548ms 2.1989 KOps/s 2.2679 KOps/s $\color{#d91a1a}-3.04\%$
test_seq_wrap[compile] 1.1860ms 0.3218ms 3.1073 KOps/s 3.2232 KOps/s $\color{#d91a1a}-3.59\%$
test_seq_wrap[compile-overhead] 0.3755ms 0.2431ms 4.1134 KOps/s 4.3013 KOps/s $\color{#d91a1a}-4.37\%$
test_func_call_runtime[False-eager] 1.0177ms 0.7935ms 1.2602 KOps/s 1.3276 KOps/s $\textbf{\color{#d91a1a}-5.08\%}$
test_func_call_runtime[False-compile] 0.9543ms 0.7792ms 1.2834 KOps/s 1.3183 KOps/s $\color{#d91a1a}-2.65\%$
test_func_call_runtime[False-compile-overhead] 0.5306ms 0.3775ms 2.6487 KOps/s 2.6951 KOps/s $\color{#d91a1a}-1.72\%$
test_func_call_runtime[True-eager] 1.2967ms 0.9634ms 1.0380 KOps/s 1.0857 KOps/s $\color{#d91a1a}-4.39\%$
test_func_call_runtime[True-compile] 0.9520ms 0.8067ms 1.2395 KOps/s 1.2594 KOps/s $\color{#d91a1a}-1.58\%$
test_func_call_runtime[True-compile-overhead] 0.4597ms 0.3957ms 2.5274 KOps/s 2.5353 KOps/s $\color{#d91a1a}-0.31\%$
test_func_call_cm_runtime[False-eager] 1.0287ms 0.7950ms 1.2579 KOps/s 1.3376 KOps/s $\textbf{\color{#d91a1a}-5.96\%}$
test_func_call_cm_runtime[False-compile] 0.9383ms 0.7804ms 1.2814 KOps/s 1.2960 KOps/s $\color{#d91a1a}-1.13\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5032ms 0.3778ms 2.6468 KOps/s 2.6702 KOps/s $\color{#d91a1a}-0.88\%$
test_func_call_cm_runtime[True-eager] 1.3093ms 1.0410ms 960.6487 Ops/s 961.1572 Ops/s $\color{#d91a1a}-0.05\%$
test_func_call_cm_runtime[True-compile] 1.3253ms 1.0585ms 944.7607 Ops/s 958.8351 Ops/s $\color{#d91a1a}-1.47\%$
test_func_call_cm_runtime[True-compile-overhead] 1.2781ms 1.0674ms 936.8697 Ops/s 968.6433 Ops/s $\color{#d91a1a}-3.28\%$
test_vmap_func_call_cm_runtime[eager] 2.6829ms 2.1635ms 462.2141 Ops/s 464.5760 Ops/s $\color{#d91a1a}-0.51\%$
test_vmap_func_call_cm_runtime[compile] 1.0601ms 0.8583ms 1.1651 KOps/s 1.1942 KOps/s $\color{#d91a1a}-2.43\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5731ms 0.4222ms 2.3685 KOps/s 2.3507 KOps/s $\color{#35bf28}+0.76\%$
test_distributed 2.8125ms 0.2123ms 4.7096 KOps/s 8.5770 KOps/s $\textbf{\color{#d91a1a}-45.09\%}$
test_tdmodule 40.9400μs 20.2778μs 49.3150 KOps/s 50.0460 KOps/s $\color{#d91a1a}-1.46\%$
test_tdmodule_dispatch 59.0200μs 38.0166μs 26.3043 KOps/s 26.1111 KOps/s $\color{#35bf28}+0.74\%$
test_tdseq 38.2010μs 20.2320μs 49.4266 KOps/s 49.8364 KOps/s $\color{#d91a1a}-0.82\%$
test_tdseq_dispatch 59.3700μs 39.0672μs 25.5969 KOps/s 25.4824 KOps/s $\color{#35bf28}+0.45\%$
test_instantiation_functorch 1.6488ms 1.5529ms 643.9721 Ops/s 644.7007 Ops/s $\color{#d91a1a}-0.11\%$
test_exec_functorch 0.2112ms 0.1457ms 6.8622 KOps/s 6.8946 KOps/s $\color{#d91a1a}-0.47\%$
test_exec_functional_call 0.3053ms 0.1382ms 7.2365 KOps/s 7.2120 KOps/s $\color{#35bf28}+0.34\%$
test_exec_td_decorator 0.4135ms 0.1893ms 5.2823 KOps/s 5.2647 KOps/s $\color{#35bf28}+0.33\%$
test_vmap_mlp_speed_decorator[True-True] 0.9020ms 0.6959ms 1.4369 KOps/s 1.4204 KOps/s $\color{#35bf28}+1.16\%$
test_vmap_mlp_speed_decorator[True-False] 0.9057ms 0.6958ms 1.4373 KOps/s 1.4209 KOps/s $\color{#35bf28}+1.15\%$
test_vmap_mlp_speed_decorator[False-True] 0.8167ms 0.6048ms 1.6535 KOps/s 1.6332 KOps/s $\color{#35bf28}+1.24\%$
test_vmap_mlp_speed_decorator[False-False] 0.7780ms 0.6068ms 1.6480 KOps/s 1.6342 KOps/s $\color{#35bf28}+0.84\%$
test_vmap_transformer_speed_decorator[True-True] 20.2841ms 19.5100ms 51.2556 Ops/s 50.7906 Ops/s $\color{#35bf28}+0.92\%$
test_vmap_transformer_speed_decorator[True-False] 20.0241ms 19.4647ms 51.3752 Ops/s 51.3552 Ops/s $\color{#35bf28}+0.04\%$
test_vmap_transformer_speed_decorator[False-True] 20.0348ms 19.3440ms 51.6956 Ops/s 51.1645 Ops/s $\color{#35bf28}+1.04\%$
test_vmap_transformer_speed_decorator[False-False] 19.6795ms 19.3940ms 51.5624 Ops/s 51.7161 Ops/s $\color{#d91a1a}-0.30\%$
test_to_module_speed[True] 1.3353ms 0.9743ms 1.0264 KOps/s 1.0275 KOps/s $\color{#d91a1a}-0.11\%$
test_to_module_speed[False] 1.4388ms 0.9605ms 1.0412 KOps/s 1.0537 KOps/s $\color{#d91a1a}-1.19\%$
test_tc_init 0.1426ms 34.7068μs 28.8128 KOps/s 29.1414 KOps/s $\color{#d91a1a}-1.13\%$
test_tc_init_tensor_only 0.1305ms 10.9457μs 91.3599 KOps/s 93.6059 KOps/s $\color{#d91a1a}-2.40\%$
test_tc_init_nested 0.1653ms 67.7418μs 14.7619 KOps/s 14.3614 KOps/s $\color{#35bf28}+2.79\%$
test_tc_first_layer_tensor 11.6785μs 0.8113μs 1.2326 MOps/s 1.0998 MOps/s $\textbf{\color{#35bf28}+12.08\%}$
test_tc_first_layer_tensor_only 1.6885μs 0.4236μs 2.3605 MOps/s 2.3590 MOps/s $\color{#35bf28}+0.06\%$
test_tc_first_layer_tensor_set 25.8400μs 2.9537μs 338.5619 KOps/s 342.1705 KOps/s $\color{#d91a1a}-1.05\%$
test_tc_first_layer_tensor_only_set 11.4200μs 1.8047μs 554.0959 KOps/s 559.1960 KOps/s $\color{#d91a1a}-0.91\%$
test_tc_first_layer_nontensor 21.1410μs 2.3748μs 421.0929 KOps/s 419.2851 KOps/s $\color{#35bf28}+0.43\%$
test_tc_second_layer_tensor 26.9800μs 1.7419μs 574.0762 KOps/s 568.1031 KOps/s $\color{#35bf28}+1.05\%$
test_tc_second_layer_nontensor 22.8900μs 3.1966μs 312.8343 KOps/s 315.0001 KOps/s $\color{#d91a1a}-0.69\%$
test_unbind 0.2291s 10.7470ms 93.0489 Ops/s 142.2998 Ops/s $\textbf{\color{#d91a1a}-34.61\%}$
test_full_like 7.6944ms 4.4776ms 223.3361 Ops/s 111.0365 Ops/s $\textbf{\color{#35bf28}+101.14\%}$
test_zeros_like 5.2541ms 4.4396ms 225.2440 Ops/s 225.9833 Ops/s $\color{#d91a1a}-0.33\%$
test_ones_like 11.7807ms 5.1140ms 195.5416 Ops/s 224.7032 Ops/s $\textbf{\color{#d91a1a}-12.98\%}$
test_clone 16.6705ms 9.6310ms 103.8319 Ops/s 145.0234 Ops/s $\textbf{\color{#d91a1a}-28.40\%}$
test_squeeze 0.1028ms 9.9978μs 100.0221 KOps/s 101.7889 KOps/s $\color{#d91a1a}-1.74\%$
test_unsqueeze 0.1821ms 77.2211μs 12.9498 KOps/s 13.6157 KOps/s $\color{#d91a1a}-4.89\%$
test_split 0.4309ms 0.1654ms 6.0470 KOps/s 6.0958 KOps/s $\color{#d91a1a}-0.80\%$
test_permute 0.2637ms 0.1797ms 5.5639 KOps/s 5.4704 KOps/s $\color{#35bf28}+1.71\%$
test_stack 54.6495ms 52.2479ms 19.1395 Ops/s 28.4970 Ops/s $\textbf{\color{#d91a1a}-32.84\%}$
test_cat 52.5898ms 51.4835ms 19.4237 Ops/s 19.3401 Ops/s $\color{#35bf28}+0.43\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants