-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Versioning] python 3.8 compatibility fix #1127
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Dec 4, 2024
ghstack-source-id: b59931a5b6d3062593228cb16fb026438c3c6105 Pull Request resolved: #1127
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Dec 4, 2024
vmoens
added a commit
that referenced
this pull request
Dec 4, 2024
ghstack-source-id: ba7e9325c4125892522ee63253c148ed34adac7c Pull Request resolved: #1127
vmoens
added a commit
that referenced
this pull request
Dec 4, 2024
ghstack-source-id: ba7e9325c4125892522ee63253c148ed34adac7c Pull Request resolved: #1127
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 44.5740μs | 18.5849μs | 53.8070 KOps/s | 54.1005 KOps/s | |
test_plain_set_stack_nested | 43.8120μs | 18.6885μs | 53.5090 KOps/s | 52.9763 KOps/s | |
test_plain_set_nested_inplace | 85.6890μs | 20.3171μs | 49.2196 KOps/s | 47.9837 KOps/s | |
test_plain_set_stack_nested_inplace | 77.0540μs | 20.3523μs | 49.1345 KOps/s | 49.0546 KOps/s | |
test_items | 20.6690μs | 4.1807μs | 239.1971 KOps/s | 242.3602 KOps/s | |
test_items_nested | 0.7226ms | 0.3986ms | 2.5090 KOps/s | 2.5297 KOps/s | |
test_items_nested_locked | 0.8204ms | 0.3993ms | 2.5043 KOps/s | 2.5177 KOps/s | |
test_items_nested_leaf | 0.1241ms | 71.2917μs | 14.0269 KOps/s | 13.9613 KOps/s | |
test_items_stack_nested | 0.6131ms | 0.3978ms | 2.5136 KOps/s | 2.5200 KOps/s | |
test_items_stack_nested_leaf | 0.1633ms | 74.2696μs | 13.4645 KOps/s | 13.1662 KOps/s | |
test_items_stack_nested_locked | 0.5212ms | 0.4017ms | 2.4892 KOps/s | 2.4858 KOps/s | |
test_keys | 24.6660μs | 3.4675μs | 288.3900 KOps/s | 287.7919 KOps/s | |
test_keys_nested | 0.6707ms | 0.1431ms | 6.9890 KOps/s | 7.3559 KOps/s | |
test_keys_nested_locked | 2.1093ms | 0.1478ms | 6.7662 KOps/s | 7.0405 KOps/s | |
test_keys_nested_leaf | 0.2299ms | 0.1169ms | 8.5533 KOps/s | 8.5642 KOps/s | |
test_keys_stack_nested | 0.2205ms | 0.1356ms | 7.3736 KOps/s | 7.4071 KOps/s | |
test_keys_stack_nested_leaf | 0.2265ms | 0.1174ms | 8.5191 KOps/s | 8.6549 KOps/s | |
test_keys_stack_nested_locked | 0.2769ms | 0.1409ms | 7.0949 KOps/s | 7.0135 KOps/s | |
test_values | 26.3412μs | 1.0758μs | 929.5178 KOps/s | 897.7016 KOps/s | |
test_values_nested | 0.1071ms | 55.9141μs | 17.8846 KOps/s | 18.1807 KOps/s | |
test_values_nested_locked | 0.2971ms | 56.1038μs | 17.8241 KOps/s | 18.0973 KOps/s | |
test_values_nested_leaf | 0.1166ms | 60.5285μs | 16.5212 KOps/s | 16.5675 KOps/s | |
test_values_stack_nested | 0.1072ms | 56.7348μs | 17.6259 KOps/s | 17.7872 KOps/s | |
test_values_stack_nested_leaf | 0.1391ms | 60.3617μs | 16.5668 KOps/s | 15.9750 KOps/s | |
test_values_stack_nested_locked | 0.3346ms | 57.0957μs | 17.5145 KOps/s | 17.2385 KOps/s | |
test_membership | 11.7920μs | 0.8675μs | 1.1528 MOps/s | 1.1660 MOps/s | |
test_membership_nested | 40.7760μs | 2.9257μs | 341.7939 KOps/s | 345.3711 KOps/s | |
test_membership_nested_leaf | 27.8210μs | 3.0197μs | 331.1593 KOps/s | 339.3983 KOps/s | |
test_membership_stacked_nested | 16.9220μs | 2.9158μs | 342.9570 KOps/s | 342.1109 KOps/s | |
test_membership_stacked_nested_leaf | 16.5510μs | 2.9197μs | 342.5042 KOps/s | 339.9051 KOps/s | |
test_membership_nested_last | 34.9750μs | 4.1613μs | 240.3082 KOps/s | 235.4210 KOps/s | |
test_membership_nested_leaf_last | 31.6290μs | 4.1966μs | 238.2862 KOps/s | 229.5091 KOps/s | |
test_membership_stacked_nested_last | 34.8850μs | 6.3817μs | 156.6981 KOps/s | 234.5219 KOps/s | |
test_membership_stacked_nested_leaf_last | 44.3720μs | 6.2443μs | 160.1454 KOps/s | 236.1297 KOps/s | |
test_nested_getleaf | 34.6550μs | 10.7589μs | 92.9465 KOps/s | 91.6146 KOps/s | |
test_nested_get | 66.7850μs | 10.1967μs | 98.0712 KOps/s | 96.2388 KOps/s | |
test_stacked_getleaf | 59.3110μs | 10.6734μs | 93.6905 KOps/s | 93.7308 KOps/s | |
test_stacked_get | 38.9130μs | 10.2306μs | 97.7464 KOps/s | 95.0987 KOps/s | |
test_nested_getitemleaf | 0.2564ms | 11.6034μs | 86.1814 KOps/s | 88.5762 KOps/s | |
test_nested_getitem | 0.1416ms | 10.8975μs | 91.7645 KOps/s | 95.8040 KOps/s | |
test_stacked_getitemleaf | 34.6850μs | 11.1740μs | 89.4937 KOps/s | 89.6071 KOps/s | |
test_stacked_getitem | 69.6220μs | 10.3326μs | 96.7811 KOps/s | 95.8049 KOps/s | |
test_lock_nested | 4.3557ms | 0.4496ms | 2.2242 KOps/s | 2.1831 KOps/s | |
test_lock_stack_nested | 0.7435ms | 0.4102ms | 2.4376 KOps/s | 2.3948 KOps/s | |
test_unlock_nested | 1.1233ms | 0.3674ms | 2.7215 KOps/s | 2.6511 KOps/s | |
test_unlock_stack_nested | 0.5585ms | 0.3254ms | 3.0736 KOps/s | 2.9405 KOps/s | |
test_flatten_speed | 0.5986ms | 95.2225μs | 10.5017 KOps/s | 10.3943 KOps/s | |
test_unflatten_speed | 0.6192ms | 0.4895ms | 2.0429 KOps/s | 2.0230 KOps/s | |
test_common_ops | 3.6704ms | 0.8009ms | 1.2486 KOps/s | 1.2286 KOps/s | |
test_creation | 20.3680μs | 2.0728μs | 482.4357 KOps/s | 497.5153 KOps/s | |
test_creation_empty | 41.8180μs | 12.2298μs | 81.7676 KOps/s | 82.1081 KOps/s | |
test_creation_nested_1 | 58.7690μs | 15.1195μs | 66.1399 KOps/s | 64.9257 KOps/s | |
test_creation_nested_2 | 0.1305ms | 21.1487μs | 47.2842 KOps/s | 47.6513 KOps/s | |
test_clone | 90.3090μs | 12.8881μs | 77.5910 KOps/s | 75.3643 KOps/s | |
test_getitem[int] | 1.2710ms | 12.9437μs | 77.2578 KOps/s | 80.1297 KOps/s | |
test_getitem[slice_int] | 0.1791ms | 24.6092μs | 40.6351 KOps/s | 40.1878 KOps/s | |
test_getitem[range] | 0.2897ms | 49.8539μs | 20.0586 KOps/s | 19.8105 KOps/s | |
test_getitem[tuple] | 0.1452ms | 20.2335μs | 49.4231 KOps/s | 49.3415 KOps/s | |
test_getitem[list] | 0.3595ms | 44.3689μs | 22.5383 KOps/s | 22.2996 KOps/s | |
test_setitem_dim[int] | 78.7470μs | 25.6312μs | 39.0150 KOps/s | 39.1122 KOps/s | |
test_setitem_dim[slice_int] | 0.1348ms | 51.2195μs | 19.5238 KOps/s | 18.5968 KOps/s | |
test_setitem_dim[range] | 0.1166ms | 73.5504μs | 13.5961 KOps/s | 13.3606 KOps/s | |
test_setitem_dim[tuple] | 71.4640μs | 40.8728μs | 24.4662 KOps/s | 23.6727 KOps/s | |
test_setitem | 0.1242ms | 20.7587μs | 48.1725 KOps/s | 47.0215 KOps/s | |
test_set | 0.1137ms | 20.5923μs | 48.5619 KOps/s | 47.2386 KOps/s | |
test_set_shared | 1.3548ms | 0.1708ms | 5.8554 KOps/s | 5.5864 KOps/s | |
test_update | 0.1547ms | 23.8911μs | 41.8566 KOps/s | 42.6568 KOps/s | |
test_update_nested | 1.1011ms | 34.4195μs | 29.0533 KOps/s | 28.7742 KOps/s | |
test_update__nested | 0.1513ms | 32.4329μs | 30.8329 KOps/s | 30.4850 KOps/s | |
test_set_nested | 0.1456ms | 22.9445μs | 43.5834 KOps/s | 44.2709 KOps/s | |
test_set_nested_new | 0.1611ms | 27.0903μs | 36.9136 KOps/s | 35.8039 KOps/s | |
test_select | 0.1419ms | 42.4771μs | 23.5421 KOps/s | 22.2761 KOps/s | |
test_select_nested | 0.4560ms | 61.6673μs | 16.2161 KOps/s | 16.7264 KOps/s | |
test_exclude_nested | 0.1535ms | 79.9095μs | 12.5142 KOps/s | 12.9048 KOps/s | |
test_empty[True] | 0.4670ms | 0.3826ms | 2.6138 KOps/s | 2.6549 KOps/s | |
test_empty[False] | 9.0690μs | 1.2046μs | 830.1813 KOps/s | 812.6427 KOps/s | |
test_unbind_speed | 0.3893ms | 0.2596ms | 3.8515 KOps/s | 3.8126 KOps/s | |
test_unbind_speed_stack0 | 0.5250ms | 0.2566ms | 3.8976 KOps/s | 3.8291 KOps/s | |
test_unbind_speed_stack1 | 0.1084s | 0.7613ms | 1.3136 KOps/s | 1.4237 KOps/s | |
test_split | 2.5048ms | 1.5887ms | 629.4260 Ops/s | 557.9234 Ops/s | |
test_chunk | 0.1120s | 1.9261ms | 519.1776 Ops/s | 558.7965 Ops/s | |
test_consolidate_njt[False-None] | 10.6630ms | 8.2288ms | 121.5242 Ops/s | 116.6861 Ops/s | |
test_creation[device0] | 0.2334ms | 91.3167μs | 10.9509 KOps/s | 10.6104 KOps/s | |
test_creation_from_tensor | 3.3085ms | 95.8635μs | 10.4315 KOps/s | 10.2018 KOps/s | |
test_add_one[memmap_tensor0] | 0.2364ms | 4.7724μs | 209.5400 KOps/s | 207.4388 KOps/s | |
test_contiguous[memmap_tensor0] | 16.0700μs | 0.5127μs | 1.9506 MOps/s | 1.9255 MOps/s | |
test_stack[memmap_tensor0] | 46.9780μs | 3.5935μs | 278.2794 KOps/s | 282.5147 KOps/s | |
test_memmaptd_index | 1.0082ms | 0.2365ms | 4.2280 KOps/s | 4.1874 KOps/s | |
test_memmaptd_index_astensor | 0.5765ms | 0.3111ms | 3.2147 KOps/s | 3.1287 KOps/s | |
test_memmaptd_index_op | 1.0498ms | 0.5919ms | 1.6896 KOps/s | 1.6596 KOps/s | |
test_serialize_model | 0.1214s | 0.1159s | 8.6296 Ops/s | 8.3853 Ops/s | |
test_serialize_model_pickle | 0.4454s | 0.3859s | 2.5913 Ops/s | 2.5339 Ops/s | |
test_serialize_weights | 0.2190s | 0.1268s | 7.8882 Ops/s | 7.2169 Ops/s | |
test_serialize_weights_returnearly | 0.1809s | 0.1582s | 6.3198 Ops/s | 6.2881 Ops/s | |
test_serialize_weights_pickle | 0.5613s | 0.4604s | 2.1722 Ops/s | 2.4635 Ops/s | |
test_serialize_weights_filesystem | 0.1535s | 0.1423s | 7.0289 Ops/s | 6.9098 Ops/s | |
test_serialize_model_filesystem | 0.1556s | 0.1468s | 6.8114 Ops/s | 6.4313 Ops/s | |
test_reshape_pytree | 68.8390μs | 27.2516μs | 36.6951 KOps/s | 37.7531 KOps/s | |
test_reshape_td | 72.4650μs | 33.0727μs | 30.2364 KOps/s | 30.2364 KOps/s | |
test_view_pytree | 60.0130μs | 26.6484μs | 37.5256 KOps/s | 35.6904 KOps/s | |
test_view_td | 85.3390μs | 37.5263μs | 26.6480 KOps/s | 26.5131 KOps/s | |
test_unbind_pytree | 77.1150μs | 29.9878μs | 33.3469 KOps/s | 33.8308 KOps/s | |
test_unbind_td | 0.3240ms | 37.7225μs | 26.5094 KOps/s | 25.7234 KOps/s | |
test_split_pytree | 73.9780μs | 29.5100μs | 33.8869 KOps/s | 34.1553 KOps/s | |
test_split_td | 0.1017s | 53.2989μs | 18.7621 KOps/s | 22.6622 KOps/s | |
test_add_pytree | 0.1054ms | 35.6397μs | 28.0586 KOps/s | 28.0542 KOps/s | |
test_add_td | 0.1265ms | 59.0482μs | 16.9353 KOps/s | 18.4918 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1364ms | 63.4412μs | 15.7626 KOps/s | 15.5229 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.5195ms | 0.1655ms | 6.0408 KOps/s | 6.2005 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1061ms | 46.3833μs | 21.5595 KOps/s | 21.5630 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2217ms | 0.1185ms | 8.4365 KOps/s | 8.4273 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 88.9270μs | 25.7728μs | 38.8005 KOps/s | 38.4174 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1028ms | 53.1685μs | 18.8081 KOps/s | 18.4753 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1442ms | 77.5692μs | 12.8917 KOps/s | 12.7307 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1523ms | 65.9976μs | 15.1521 KOps/s | 14.7282 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2369ms | 0.1062ms | 9.4194 KOps/s | 9.6190 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3214ms | 0.1970ms | 5.0760 KOps/s | 5.0852 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1114ms | 45.3770μs | 22.0376 KOps/s | 22.6510 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4853ms | 61.4536μs | 16.2724 KOps/s | 16.2818 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2178ms | 0.1054ms | 9.4841 KOps/s | 9.5222 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3506ms | 0.2000ms | 4.9991 KOps/s | 5.0058 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4361ms | 0.2086ms | 4.7932 KOps/s | 4.7901 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1783ms | 0.1060ms | 9.4373 KOps/s | 9.5990 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1172ms | 55.1597μs | 18.1292 KOps/s | 18.0475 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1028ms | 47.7454μs | 20.9444 KOps/s | 22.1273 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.3360ms | 0.1602ms | 6.2416 KOps/s | 6.3525 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2485ms | 0.1069ms | 9.3560 KOps/s | 9.7777 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 80.6700μs | 21.0643μs | 47.4738 KOps/s | 47.7861 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1352ms | 60.3162μs | 16.5793 KOps/s | 17.2212 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1490ms | 80.6929μs | 12.3927 KOps/s | 12.4618 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1244ms | 67.6832μs | 14.7747 KOps/s | 14.5642 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3247ms | 0.2115ms | 4.7287 KOps/s | 4.9393 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.4396ms | 1.2891ms | 775.7465 Ops/s | 771.6264 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3001ms | 0.2065ms | 4.8430 KOps/s | 4.9810 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 0.8738ms | 0.7773ms | 1.2866 KOps/s | 1.2871 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.5776ms | 0.4633ms | 2.1585 KOps/s | 2.2286 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.2840ms | 2.6701ms | 374.5175 Ops/s | 368.7355 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1067ms | 37.2024μs | 26.8800 KOps/s | 27.7460 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.6959ms | 33.6023μs | 29.7599 KOps/s | 20.6146 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 89.9580μs | 30.2915μs | 33.0126 KOps/s | 32.9360 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1077ms | 23.7815μs | 42.0494 KOps/s | 43.0419 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1296ms | 31.4687μs | 31.7777 KOps/s | 32.2791 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1890ms | 24.6172μs | 40.6221 KOps/s | 42.2722 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1326ms | 51.5790μs | 19.3877 KOps/s | 19.1011 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5994ms | 19.8568μs | 50.3606 KOps/s | 49.4737 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1501ms | 44.0731μs | 22.6896 KOps/s | 22.7659 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 64.2400μs | 18.8850μs | 52.9521 KOps/s | 53.9675 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1170ms | 45.2780μs | 22.0858 KOps/s | 22.0607 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 68.1170μs | 18.9864μs | 52.6694 KOps/s | 53.0921 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1052ms | 52.5850μs | 19.0168 KOps/s | 18.2448 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.0700ms | 19.5023μs | 51.2760 KOps/s | 50.3881 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1342ms | 43.8320μs | 22.8144 KOps/s | 21.6929 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 73.7670μs | 18.6572μs | 53.5987 KOps/s | 53.7210 KOps/s | |
test_compile_indexing[int-pytree-compile] | 97.9430μs | 45.0525μs | 22.1963 KOps/s | 21.9676 KOps/s | |
test_compile_indexing[int-pytree-eager] | 68.0370μs | 18.7029μs | 53.4676 KOps/s | 53.6262 KOps/s | |
test_mod_add[eager] | 95.7390μs | 35.9034μs | 27.8525 KOps/s | 27.9449 KOps/s | |
test_mod_add[compile] | 0.1157ms | 48.7420μs | 20.5162 KOps/s | 20.6276 KOps/s | |
test_mod_add[compile-overhead] | 99.6560μs | 49.0696μs | 20.3792 KOps/s | 20.6923 KOps/s | |
test_mod_wrap[eager] | 0.5368ms | 0.2321ms | 4.3076 KOps/s | 4.4375 KOps/s | |
test_mod_wrap[compile] | 0.4169ms | 0.2147ms | 4.6566 KOps/s | 4.8597 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3173ms | 0.2096ms | 4.7705 KOps/s | 4.8573 KOps/s | |
test_mod_wrap_and_backward[eager] | 13.4372ms | 11.2529ms | 88.8659 Ops/s | 89.9260 Ops/s | |
test_mod_wrap_and_backward[compile] | 16.1758ms | 12.5156ms | 79.9000 Ops/s | 87.5073 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 18.2842ms | 13.0361ms | 76.7101 Ops/s | 76.6244 Ops/s | |
test_seq_add[eager] | 0.3396ms | 0.1161ms | 8.6142 KOps/s | 8.5644 KOps/s | |
test_seq_add[compile] | 0.1194ms | 63.4331μs | 15.7646 KOps/s | 16.4601 KOps/s | |
test_seq_add[compile-overhead] | 0.3202ms | 63.1844μs | 15.8267 KOps/s | 16.5280 KOps/s | |
test_seq_wrap[eager] | 0.6212ms | 0.4479ms | 2.2328 KOps/s | 2.2443 KOps/s | |
test_seq_wrap[compile] | 0.6495ms | 0.2381ms | 4.1995 KOps/s | 4.3516 KOps/s | |
test_seq_wrap[compile-overhead] | 0.5221ms | 0.2319ms | 4.3129 KOps/s | 4.3661 KOps/s | |
test_func_call_runtime[False-eager] | 1.0503ms | 0.5545ms | 1.8035 KOps/s | 1.8315 KOps/s | |
test_func_call_runtime[False-compile] | 0.8818ms | 0.4331ms | 2.3091 KOps/s | 2.3660 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8850ms | 0.4343ms | 2.3026 KOps/s | 2.3694 KOps/s | |
test_func_call_runtime[True-eager] | 0.8877ms | 0.7510ms | 1.3315 KOps/s | 1.3362 KOps/s | |
test_func_call_runtime[True-compile] | 0.7013ms | 0.4739ms | 2.1101 KOps/s | 2.1373 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6532ms | 0.4725ms | 2.1164 KOps/s | 2.1220 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.6944ms | 0.5397ms | 1.8528 KOps/s | 1.8246 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8519ms | 0.4316ms | 2.3171 KOps/s | 2.3545 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5886ms | 0.4318ms | 2.3159 KOps/s | 2.3744 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4480ms | 0.9091ms | 1.1000 KOps/s | 1.1315 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.7105ms | 0.5002ms | 1.9992 KOps/s | 2.0118 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.8257ms | 0.4982ms | 2.0072 KOps/s | 2.0169 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.3760ms | 1.9053ms | 524.8501 Ops/s | 533.6447 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8839ms | 0.5229ms | 1.9125 KOps/s | 1.9169 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.8664ms | 0.5299ms | 1.8870 KOps/s | 1.9119 KOps/s | |
test_distributed | 0.2480ms | 0.1261ms | 7.9314 KOps/s | 7.3282 KOps/s | |
test_tdmodule | 81.1520μs | 26.5097μs | 37.7220 KOps/s | 37.3176 KOps/s | |
test_tdmodule_dispatch | 77.8360μs | 48.4609μs | 20.6352 KOps/s | 19.7202 KOps/s | |
test_tdseq | 47.6890μs | 26.7877μs | 37.3305 KOps/s | 37.7595 KOps/s | |
test_tdseq_dispatch | 82.0330μs | 52.5882μs | 19.0157 KOps/s | 18.4661 KOps/s | |
test_instantiation_functorch | 2.3160ms | 1.5518ms | 644.4219 Ops/s | 649.8235 Ops/s | |
test_exec_functorch | 0.3019ms | 0.1851ms | 5.4031 KOps/s | 5.4915 KOps/s | |
test_exec_functional_call | 0.3250ms | 0.1777ms | 5.6265 KOps/s | 5.5798 KOps/s | |
test_exec_td_decorator | 0.4884ms | 0.2311ms | 4.3265 KOps/s | 4.3197 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9745ms | 0.6671ms | 1.4991 KOps/s | 1.5303 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9573ms | 0.6567ms | 1.5229 KOps/s | 1.5472 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8673ms | 0.5275ms | 1.8956 KOps/s | 1.9209 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7662ms | 0.5238ms | 1.9091 KOps/s | 1.9255 KOps/s | |
test_to_module_speed[True] | 1.9318ms | 1.2968ms | 771.1283 Ops/s | 772.2169 Ops/s | |
test_to_module_speed[False] | 1.6669ms | 1.2778ms | 782.5961 Ops/s | 798.1392 Ops/s | |
test_tc_init | 78.8170μs | 46.8373μs | 21.3505 KOps/s | 20.8914 KOps/s | |
test_tc_init_nested | 0.1853ms | 91.5961μs | 10.9175 KOps/s | 10.3477 KOps/s | |
test_tc_first_layer_tensor | 39.1130μs | 1.5537μs | 643.6288 KOps/s | 655.0711 KOps/s | |
test_tc_first_layer_nontensor | 29.3740μs | 4.6914μs | 213.1581 KOps/s | 209.3773 KOps/s | |
test_tc_second_layer_tensor | 52.0980μs | 2.8608μs | 349.5532 KOps/s | 358.6616 KOps/s | |
test_tc_second_layer_nontensor | 50.4140μs | 6.0955μs | 164.0545 KOps/s | 165.7680 KOps/s | |
test_unbind | 0.2344s | 13.4507ms | 74.3457 Ops/s | 75.5177 Ops/s | |
test_full_like | 10.7647ms | 8.4601ms | 118.2022 Ops/s | 126.8764 Ops/s | |
test_zeros_like | 7.3960ms | 3.3672ms | 296.9840 Ops/s | 343.0516 Ops/s | |
test_ones_like | 6.0233ms | 3.8060ms | 262.7429 Ops/s | 156.8629 Ops/s | |
test_clone | 6.2252ms | 5.1592ms | 193.8279 Ops/s | 117.0705 Ops/s | |
test_squeeze | 68.6880μs | 12.5040μs | 79.9746 KOps/s | 85.3968 KOps/s | |
test_unsqueeze | 0.1586ms | 88.3960μs | 11.3127 KOps/s | 11.2215 KOps/s | |
test_split | 0.3957ms | 0.1944ms | 5.1441 KOps/s | 5.2550 KOps/s | |
test_permute | 0.5744ms | 0.2244ms | 4.4571 KOps/s | 4.5789 KOps/s | |
test_stack | 33.2437ms | 25.8809ms | 38.6385 Ops/s | 38.0484 Ops/s | |
test_cat | 32.0759ms | 25.7658ms | 38.8112 Ops/s | 37.3259 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 36.6510μs | 10.9236μs | 91.5453 KOps/s | 96.4125 KOps/s | |
test_plain_set_stack_nested | 36.1200μs | 10.9013μs | 91.7321 KOps/s | 95.8888 KOps/s | |
test_plain_set_nested_inplace | 59.5920μs | 11.7704μs | 84.9591 KOps/s | 88.6062 KOps/s | |
test_plain_set_stack_nested_inplace | 41.3510μs | 11.8792μs | 84.1810 KOps/s | 88.8053 KOps/s | |
test_items | 33.2210μs | 2.8585μs | 349.8287 KOps/s | 338.8499 KOps/s | |
test_items_nested | 0.4474ms | 0.3571ms | 2.8007 KOps/s | 2.8549 KOps/s | |
test_items_nested_locked | 0.3932ms | 0.3552ms | 2.8151 KOps/s | 2.8389 KOps/s | |
test_items_nested_leaf | 91.0220μs | 58.0494μs | 17.2267 KOps/s | 17.1664 KOps/s | |
test_items_stack_nested | 0.4775ms | 0.3570ms | 2.8011 KOps/s | 2.8348 KOps/s | |
test_items_stack_nested_leaf | 92.7420μs | 57.8209μs | 17.2948 KOps/s | 16.8894 KOps/s | |
test_items_stack_nested_locked | 0.4011ms | 0.3556ms | 2.8123 KOps/s | 2.8094 KOps/s | |
test_keys | 31.8700μs | 3.5010μs | 285.6336 KOps/s | 288.3943 KOps/s | |
test_keys_nested | 0.1045ms | 70.0036μs | 14.2850 KOps/s | 14.0960 KOps/s | |
test_keys_nested_locked | 0.7718ms | 76.0813μs | 13.1438 KOps/s | 13.1157 KOps/s | |
test_keys_nested_leaf | 97.8520μs | 61.6643μs | 16.2168 KOps/s | 16.2765 KOps/s | |
test_keys_stack_nested | 0.1115ms | 70.3170μs | 14.2213 KOps/s | 14.1130 KOps/s | |
test_keys_stack_nested_leaf | 0.1004ms | 61.3258μs | 16.3063 KOps/s | 15.9152 KOps/s | |
test_keys_stack_nested_locked | 0.1189ms | 75.5893μs | 13.2294 KOps/s | 13.0306 KOps/s | |
test_values | 5.9452μs | 0.8469μs | 1.1808 MOps/s | 1.1755 MOps/s | |
test_values_nested | 60.1820μs | 31.1131μs | 32.1408 KOps/s | 32.2819 KOps/s | |
test_values_nested_locked | 58.6010μs | 32.7434μs | 30.5405 KOps/s | 30.6097 KOps/s | |
test_values_nested_leaf | 73.4620μs | 33.6024μs | 29.7598 KOps/s | 29.6940 KOps/s | |
test_values_stack_nested | 56.2910μs | 31.3232μs | 31.9252 KOps/s | 31.2868 KOps/s | |
test_values_stack_nested_leaf | 60.9420μs | 33.7606μs | 29.6204 KOps/s | 29.0493 KOps/s | |
test_values_stack_nested_locked | 68.0720μs | 33.1322μs | 30.1821 KOps/s | 29.8655 KOps/s | |
test_membership | 2.0580μs | 0.5100μs | 1.9608 MOps/s | 1.9660 MOps/s | |
test_membership_nested | 20.4205μs | 1.9655μs | 508.7636 KOps/s | 500.9542 KOps/s | |
test_membership_nested_leaf | 19.1005μs | 1.9517μs | 512.3644 KOps/s | 491.9401 KOps/s | |
test_membership_stacked_nested | 26.1700μs | 2.0669μs | 483.8100 KOps/s | 476.9157 KOps/s | |
test_membership_stacked_nested_leaf | 32.1100μs | 2.0530μs | 487.0888 KOps/s | 480.8793 KOps/s | |
test_membership_nested_last | 41.1900μs | 3.3615μs | 297.4877 KOps/s | 339.2930 KOps/s | |
test_membership_nested_leaf_last | 35.5410μs | 2.9630μs | 337.4932 KOps/s | 338.0500 KOps/s | |
test_membership_stacked_nested_last | 26.9210μs | 2.9295μs | 341.3577 KOps/s | 295.5853 KOps/s | |
test_membership_stacked_nested_leaf_last | 26.4200μs | 3.0161μs | 331.5495 KOps/s | 297.0164 KOps/s | |
test_nested_getleaf | 36.4510μs | 6.2174μs | 160.8393 KOps/s | 162.7424 KOps/s | |
test_nested_get | 28.9600μs | 5.9362μs | 168.4584 KOps/s | 171.2693 KOps/s | |
test_stacked_getleaf | 29.9300μs | 6.2584μs | 159.7863 KOps/s | 163.2343 KOps/s | |
test_stacked_get | 0.1012ms | 5.9898μs | 166.9512 KOps/s | 171.4864 KOps/s | |
test_nested_getitemleaf | 75.2910μs | 6.4535μs | 154.9537 KOps/s | 160.8501 KOps/s | |
test_nested_getitem | 42.9010μs | 6.0292μs | 165.8598 KOps/s | 170.5801 KOps/s | |
test_stacked_getitemleaf | 28.2900μs | 6.3255μs | 158.0891 KOps/s | 161.1957 KOps/s | |
test_stacked_getitem | 28.6200μs | 6.0047μs | 166.5366 KOps/s | 169.6416 KOps/s | |
test_lock_nested | 8.8794ms | 0.3820ms | 2.6181 KOps/s | 2.6273 KOps/s | |
test_lock_stack_nested | 0.3839ms | 0.3378ms | 2.9605 KOps/s | 2.9387 KOps/s | |
test_unlock_nested | 0.6518ms | 0.3130ms | 3.1946 KOps/s | 3.1927 KOps/s | |
test_unlock_stack_nested | 0.3171ms | 0.2795ms | 3.5779 KOps/s | 3.5994 KOps/s | |
test_flatten_speed | 0.1085ms | 76.8573μs | 13.0111 KOps/s | 13.4130 KOps/s | |
test_unflatten_speed | 0.3587ms | 0.3111ms | 3.2146 KOps/s | 3.2775 KOps/s | |
test_common_ops | 1.7693ms | 0.6084ms | 1.6435 KOps/s | 1.6750 KOps/s | |
test_creation | 98.7920μs | 1.4738μs | 678.4954 KOps/s | 682.4262 KOps/s | |
test_creation_empty | 31.0310μs | 7.8645μs | 127.1536 KOps/s | 143.6285 KOps/s | |
test_creation_nested_1 | 39.5410μs | 9.3400μs | 107.0668 KOps/s | 118.8723 KOps/s | |
test_creation_nested_2 | 35.7700μs | 11.8776μs | 84.1922 KOps/s | 91.5844 KOps/s | |
test_clone | 50.1710μs | 10.3974μs | 96.1780 KOps/s | 89.4349 KOps/s | |
test_getitem[int] | 91.7955ms | 15.5332μs | 64.3784 KOps/s | 89.1582 KOps/s | |
test_getitem[slice_int] | 0.1067ms | 20.6927μs | 48.3263 KOps/s | 45.2534 KOps/s | |
test_getitem[range] | 0.1411ms | 37.6094μs | 26.5891 KOps/s | 25.1131 KOps/s | |
test_getitem[tuple] | 0.1049ms | 18.1289μs | 55.1605 KOps/s | 51.5144 KOps/s | |
test_getitem[list] | 0.2478ms | 33.1237μs | 30.1899 KOps/s | 28.1421 KOps/s | |
test_setitem_dim[int] | 38.3210μs | 18.8621μs | 53.0164 KOps/s | 51.1363 KOps/s | |
test_setitem_dim[slice_int] | 59.5910μs | 38.3799μs | 26.0553 KOps/s | 25.2331 KOps/s | |
test_setitem_dim[range] | 76.0820μs | 53.8240μs | 18.5791 KOps/s | 18.0086 KOps/s | |
test_setitem_dim[tuple] | 53.0910μs | 32.7851μs | 30.5016 KOps/s | 29.5712 KOps/s | |
test_setitem | 83.0920μs | 14.6621μs | 68.2029 KOps/s | 63.7553 KOps/s | |
test_set | 75.7020μs | 14.2175μs | 70.3359 KOps/s | 65.6979 KOps/s | |
test_set_shared | 1.5815ms | 0.1471ms | 6.7993 KOps/s | 6.7372 KOps/s | |
test_update | 0.2678ms | 17.0207μs | 58.7522 KOps/s | 57.1006 KOps/s | |
test_update_nested | 76.3410μs | 21.9656μs | 45.5257 KOps/s | 43.4126 KOps/s | |
test_update__nested | 1.0441ms | 24.5400μs | 40.7497 KOps/s | 39.3447 KOps/s | |
test_set_nested | 79.3820μs | 15.3208μs | 65.2707 KOps/s | 60.3033 KOps/s | |
test_set_nested_new | 86.4420μs | 17.7360μs | 56.3825 KOps/s | 46.6023 KOps/s | |
test_select | 89.4820μs | 29.8086μs | 33.5474 KOps/s | 29.6809 KOps/s | |
test_select_nested | 65.7510μs | 41.5317μs | 24.0780 KOps/s | 23.8821 KOps/s | |
test_exclude_nested | 89.8520μs | 61.9990μs | 16.1293 KOps/s | 16.3491 KOps/s | |
test_empty[True] | 0.3035ms | 0.2774ms | 3.6046 KOps/s | 3.6477 KOps/s | |
test_empty[False] | 3.8961μs | 0.7459μs | 1.3407 MOps/s | 1.3478 MOps/s | |
test_to | 88.8810μs | 55.7797μs | 17.9277 KOps/s | 16.9607 KOps/s | |
test_to_nonblocking | 0.1011ms | 45.6197μs | 21.9204 KOps/s | 21.1427 KOps/s | |
test_unbind_speed | 1.4674ms | 0.2321ms | 4.3078 KOps/s | 4.2742 KOps/s | |
test_unbind_speed_stack0 | 0.2652ms | 0.2327ms | 4.2981 KOps/s | 4.2779 KOps/s | |
test_unbind_speed_stack1 | 93.3499ms | 0.6514ms | 1.5351 KOps/s | 1.5387 KOps/s | |
test_split | 93.6369ms | 1.7158ms | 582.8306 Ops/s | 612.3013 Ops/s | |
test_chunk | 94.8771ms | 1.5975ms | 625.9860 Ops/s | 607.5958 Ops/s | |
test_consolidate[False-None] | 2.7123ms | 2.6515ms | 377.1496 Ops/s | 344.3569 Ops/s | |
test_consolidate[default-None] | 1.8037ms | 1.6856ms | 593.2476 Ops/s | 575.7367 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8419ms | 1.7174ms | 582.2699 Ops/s | 566.8801 Ops/s | |
test_consolidate_njt[False-None] | 6.7285ms | 6.5744ms | 152.1052 Ops/s | 147.3508 Ops/s | |
test_to[False-False-None] | 1.8362ms | 1.7559ms | 569.4935 Ops/s | 579.4321 Ops/s | |
test_to[True-False-None] | 1.5284ms | 1.3000ms | 769.2489 Ops/s | 739.4569 Ops/s | |
test_to[within-False-None] | 4.4122ms | 4.1010ms | 243.8451 Ops/s | 241.7887 Ops/s | |
test_to[True-default-None] | 5.3672ms | 5.1981ms | 192.3776 Ops/s | 185.3742 Ops/s | |
test_to_njt[False-False-None] | 7.2287ms | 7.0579ms | 141.6848 Ops/s | 139.6915 Ops/s | |
test_to_njt[True-False-None] | 5.6859ms | 5.5733ms | 179.4265 Ops/s | 173.8803 Ops/s | |
test_to_njt[within-False-None] | 12.4311ms | 12.2987ms | 81.3096 Ops/s | 78.9137 Ops/s | |
test_creation[device0] | 0.4503ms | 80.2933μs | 12.4543 KOps/s | 11.9941 KOps/s | |
test_creation_from_tensor | 0.4924ms | 83.1314μs | 12.0291 KOps/s | 11.7746 KOps/s | |
test_add_one[memmap_tensor0] | 0.2269ms | 7.0049μs | 142.7565 KOps/s | 140.1507 KOps/s | |
test_contiguous[memmap_tensor0] | 1.7946μs | 0.4123μs | 2.4256 MOps/s | 2.3963 MOps/s | |
test_stack[memmap_tensor0] | 38.9910μs | 4.4475μs | 224.8439 KOps/s | 208.2513 KOps/s | |
test_memmaptd_index | 1.5280ms | 0.2558ms | 3.9096 KOps/s | 3.8684 KOps/s | |
test_memmaptd_index_astensor | 0.6138ms | 0.3131ms | 3.1937 KOps/s | 3.2000 KOps/s | |
test_memmaptd_index_op | 1.0498ms | 0.6043ms | 1.6549 KOps/s | 1.6992 KOps/s | |
test_serialize_model | 0.1309s | 0.1301s | 7.6842 Ops/s | 7.6845 Ops/s | |
test_serialize_model_pickle | 1.3469s | 1.2122s | 0.8250 Ops/s | 0.8256 Ops/s | |
test_serialize_weights | 0.1307s | 0.1297s | 7.7078 Ops/s | 7.7433 Ops/s | |
test_serialize_weights_returnearly | 0.6094s | 66.1437ms | 15.1186 Ops/s | 14.5139 Ops/s | |
test_serialize_weights_pickle | 1.3475s | 1.1848s | 0.8440 Ops/s | 0.8382 Ops/s | |
test_reshape_pytree | 51.6410μs | 22.4914μs | 44.4614 KOps/s | 43.6855 KOps/s | |
test_reshape_td | 58.1010μs | 27.1978μs | 36.7677 KOps/s | 36.0648 KOps/s | |
test_view_pytree | 56.8510μs | 22.5699μs | 44.3068 KOps/s | 44.1909 KOps/s | |
test_view_td | 66.7810μs | 30.0467μs | 33.2815 KOps/s | 31.2316 KOps/s | |
test_unbind_pytree | 57.1810μs | 28.2395μs | 35.4114 KOps/s | 35.0199 KOps/s | |
test_unbind_td | 0.7455ms | 36.1225μs | 27.6836 KOps/s | 27.2387 KOps/s | |
test_split_pytree | 57.2610μs | 29.9605μs | 33.3773 KOps/s | 32.0242 KOps/s | |
test_split_td | 0.9511ms | 38.9699μs | 25.6608 KOps/s | 24.5917 KOps/s | |
test_add_pytree | 61.3210μs | 35.3699μs | 28.2726 KOps/s | 27.8431 KOps/s | |
test_add_td | 79.3820μs | 47.2286μs | 21.1736 KOps/s | 21.6484 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1731ms | 0.1210ms | 8.2626 KOps/s | 7.9441 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2240ms | 0.1259ms | 7.9401 KOps/s | 7.7944 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1337ms | 97.0072μs | 10.3085 KOps/s | 10.0355 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 2.1413ms | 0.1510ms | 6.6239 KOps/s | 6.5248 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 54.9010μs | 22.9002μs | 43.6677 KOps/s | 44.1255 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 57.0210μs | 27.0809μs | 36.9264 KOps/s | 36.8101 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1125ms | 64.9945μs | 15.3859 KOps/s | 15.1319 KOps/s | |
test_compile_copy_nested[pytree-eager] | 78.9210μs | 49.3220μs | 20.2749 KOps/s | 20.0085 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1839ms | 0.1442ms | 6.9336 KOps/s | 6.8638 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3059ms | 0.2107ms | 4.7468 KOps/s | 4.8505 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1377ms | 0.1004ms | 9.9570 KOps/s | 10.0396 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1163ms | 53.4990μs | 18.6920 KOps/s | 18.5612 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1825ms | 0.1378ms | 7.2562 KOps/s | 7.2595 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5562ms | 0.4904ms | 2.0390 KOps/s | 2.0125 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3673ms | 0.2504ms | 3.9932 KOps/s | 4.0022 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1867ms | 0.1459ms | 6.8539 KOps/s | 6.6971 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1447ms | 63.4521μs | 15.7599 KOps/s | 15.7533 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2369ms | 0.1042ms | 9.5924 KOps/s | 10.0834 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5874ms | 0.4217ms | 2.3716 KOps/s | 2.3828 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2581ms | 0.1468ms | 6.8133 KOps/s | 7.2881 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1268ms | 19.9056μs | 50.2372 KOps/s | 53.8933 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1129ms | 26.6657μs | 37.5013 KOps/s | 36.9076 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1433ms | 69.3768μs | 14.4140 KOps/s | 14.1487 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1368ms | 50.6691μs | 19.7359 KOps/s | 19.2621 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6470ms | 0.3953ms | 2.5300 KOps/s | 2.1976 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7143ms | 2.6479ms | 377.6524 Ops/s | 370.1032 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6492ms | 0.4447ms | 2.2488 KOps/s | 2.2282 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.7596ms | 2.7005ms | 370.3036 Ops/s | 363.5488 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.2660ms | 0.1219ms | 8.2024 KOps/s | 8.3066 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5521ms | 79.7277μs | 12.5427 KOps/s | 12.0229 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.6846ms | 0.1147ms | 8.7176 KOps/s | 9.3619 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1270ms | 73.8341μs | 13.5439 KOps/s | 14.3230 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.3096ms | 0.1163ms | 8.5967 KOps/s | 9.2857 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1792ms | 74.0720μs | 13.5004 KOps/s | 14.3995 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1598ms | 0.1050ms | 9.5205 KOps/s | 9.4527 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1480ms | 17.7178μs | 56.4403 KOps/s | 53.1103 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1350ms | 97.1268μs | 10.2958 KOps/s | 10.1438 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 52.1110μs | 15.9124μs | 62.8440 KOps/s | 60.2333 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1509ms | 0.1014ms | 9.8618 KOps/s | 10.0784 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 60.2910μs | 15.8182μs | 63.2184 KOps/s | 60.6951 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1481ms | 0.1039ms | 9.6278 KOps/s | 9.6025 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5608ms | 17.4891μs | 57.1784 KOps/s | 53.2374 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1466ms | 98.3315μs | 10.1697 KOps/s | 10.0728 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 44.6910μs | 15.9960μs | 62.5157 KOps/s | 61.2090 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1395ms | 0.1023ms | 9.7795 KOps/s | 10.1117 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1281ms | 16.9555μs | 58.9780 KOps/s | 61.1677 KOps/s | |
test_mod_add[eager] | 83.8020μs | 39.5854μs | 25.2618 KOps/s | 26.4372 KOps/s | |
test_mod_add[compile] | 0.1198ms | 80.8431μs | 12.3696 KOps/s | 12.0210 KOps/s | |
test_mod_add[compile-overhead] | 0.3241ms | 0.1673ms | 5.9757 KOps/s | 5.6371 KOps/s | |
test_mod_wrap[eager] | 0.3475ms | 0.2559ms | 3.9071 KOps/s | 3.8894 KOps/s | |
test_mod_wrap[compile] | 0.8058ms | 0.2875ms | 3.4786 KOps/s | 3.4189 KOps/s | |
test_mod_wrap[compile-overhead] | 7.2105ms | 3.8110ms | 262.3961 Ops/s | 264.0811 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5111ms | 1.3957ms | 716.4813 Ops/s | 667.6135 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3970ms | 1.2768ms | 783.2225 Ops/s | 715.1015 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3751ms | 0.9258ms | 1.0801 KOps/s | 962.8859 Ops/s | |
test_seq_add[eager] | 0.1654ms | 0.1139ms | 8.7770 KOps/s | 8.7420 KOps/s | |
test_seq_add[compile] | 0.1420ms | 88.4623μs | 11.3043 KOps/s | 10.9937 KOps/s | |
test_seq_add[compile-overhead] | 0.1820ms | 0.1290ms | 7.7548 KOps/s | 7.5946 KOps/s | |
test_seq_wrap[eager] | 0.5029ms | 0.4343ms | 2.3024 KOps/s | 2.3623 KOps/s | |
test_seq_wrap[compile] | 0.4234ms | 0.3072ms | 3.2555 KOps/s | 3.2390 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2808ms | 0.2233ms | 4.4780 KOps/s | 4.3801 KOps/s | |
test_func_call_runtime[False-eager] | 0.8760ms | 0.7746ms | 1.2910 KOps/s | 1.2998 KOps/s | |
test_func_call_runtime[False-compile] | 0.8529ms | 0.7601ms | 1.3156 KOps/s | 1.3104 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4358ms | 0.3648ms | 2.7413 KOps/s | 2.7192 KOps/s | |
test_func_call_runtime[True-eager] | 1.0745ms | 0.9236ms | 1.0828 KOps/s | 1.0744 KOps/s | |
test_func_call_runtime[True-compile] | 0.8119ms | 0.7699ms | 1.2989 KOps/s | 1.2733 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4964ms | 0.3853ms | 2.5954 KOps/s | 2.5797 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8245ms | 0.7521ms | 1.3296 KOps/s | 1.3117 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8053ms | 0.7503ms | 1.3328 KOps/s | 1.3031 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4286ms | 0.3664ms | 2.7295 KOps/s | 2.7024 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1211ms | 1.0279ms | 972.8375 Ops/s | 968.7507 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9102ms | 0.7996ms | 1.2506 KOps/s | 1.2232 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4743ms | 0.4116ms | 2.4298 KOps/s | 2.4094 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5729ms | 2.1029ms | 475.5312 Ops/s | 474.5265 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8805ms | 0.8113ms | 1.2326 KOps/s | 1.2034 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4868ms | 0.4118ms | 2.4282 KOps/s | 2.3802 KOps/s | |
test_distributed | 4.0609ms | 0.3173ms | 3.1519 KOps/s | 8.4837 KOps/s | |
test_tdmodule | 57.4710μs | 19.0081μs | 52.6092 KOps/s | 52.7672 KOps/s | |
test_tdmodule_dispatch | 84.8120μs | 34.5397μs | 28.9522 KOps/s | 29.7152 KOps/s | |
test_tdseq | 38.8810μs | 19.1828μs | 52.1300 KOps/s | 54.0071 KOps/s | |
test_tdseq_dispatch | 58.9510μs | 37.0301μs | 27.0050 KOps/s | 28.1755 KOps/s | |
test_instantiation_functorch | 1.6204ms | 1.5255ms | 655.5436 Ops/s | 633.2222 Ops/s | |
test_exec_functorch | 0.1980ms | 0.1434ms | 6.9749 KOps/s | 6.6983 KOps/s | |
test_exec_functional_call | 0.1681ms | 0.1392ms | 7.1813 KOps/s | 6.9568 KOps/s | |
test_exec_td_decorator | 0.3650ms | 0.1841ms | 5.4319 KOps/s | 5.3385 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8189ms | 0.6921ms | 1.4449 KOps/s | 1.4555 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8535ms | 0.6915ms | 1.4461 KOps/s | 1.4544 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7176ms | 0.5994ms | 1.6684 KOps/s | 1.6676 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7219ms | 0.6004ms | 1.6655 KOps/s | 1.6631 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.3975ms | 19.3310ms | 51.7305 Ops/s | 51.8756 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.0542ms | 19.4037ms | 51.5367 Ops/s | 51.8703 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.3583ms | 19.2732ms | 51.8854 Ops/s | 52.2212 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.3063ms | 19.2409ms | 51.9727 Ops/s | 52.2255 Ops/s | |
test_to_module_speed[True] | 1.0324ms | 0.9356ms | 1.0689 KOps/s | 1.0647 KOps/s | |
test_to_module_speed[False] | 1.3083ms | 0.9308ms | 1.0743 KOps/s | 1.0945 KOps/s | |
test_tc_init | 70.3510μs | 36.7201μs | 27.2330 KOps/s | 28.7852 KOps/s | |
test_tc_init_nested | 0.2097ms | 74.3437μs | 13.4510 KOps/s | 13.9384 KOps/s | |
test_tc_first_layer_tensor | 5.0744μs | 0.6964μs | 1.4359 MOps/s | 1.4385 MOps/s | |
test_tc_first_layer_nontensor | 22.0200μs | 2.3128μs | 432.3768 KOps/s | 430.0189 KOps/s | |
test_tc_second_layer_tensor | 11.6570μs | 1.4443μs | 692.3877 KOps/s | 712.0949 KOps/s | |
test_tc_second_layer_nontensor | 26.0300μs | 3.0778μs | 324.9055 KOps/s | 327.6916 KOps/s | |
test_unbind | 0.2237s | 9.7359ms | 102.7123 Ops/s | 148.0410 Ops/s | |
test_full_like | 12.1973ms | 9.1929ms | 108.7791 Ops/s | 105.5583 Ops/s | |
test_zeros_like | 5.3271ms | 4.2142ms | 237.2948 Ops/s | 113.6337 Ops/s | |
test_ones_like | 4.9370ms | 4.2756ms | 233.8828 Ops/s | 232.2611 Ops/s | |
test_clone | 11.4001ms | 9.0161ms | 110.9128 Ops/s | 158.7720 Ops/s | |
test_squeeze | 92.6210μs | 9.5104μs | 105.1476 KOps/s | 104.3574 KOps/s | |
test_unsqueeze | 0.1200ms | 71.0279μs | 14.0790 KOps/s | 13.5958 KOps/s | |
test_split | 0.3831ms | 0.1588ms | 6.2980 KOps/s | 5.9449 KOps/s | |
test_permute | 0.3032ms | 0.1776ms | 5.6322 KOps/s | 5.5119 KOps/s | |
test_stack | 50.6048ms | 50.3333ms | 19.8676 Ops/s | 18.9092 Ops/s | |
test_cat | 51.5313ms | 50.2388ms | 19.9049 Ops/s | 19.8153 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):