-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Add missing __torch_function__
#1169
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Jan 9, 2025
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 53.8000μs | 20.7739μs | 48.1374 KOps/s | 49.7424 KOps/s | |
test_plain_set_stack_nested | 68.5770μs | 20.8621μs | 47.9338 KOps/s | 49.7396 KOps/s | |
test_plain_set_nested_inplace | 56.8860μs | 22.8246μs | 43.8123 KOps/s | 45.5541 KOps/s | |
test_plain_set_stack_nested_inplace | 61.7040μs | 22.6793μs | 44.0931 KOps/s | 44.7143 KOps/s | |
test_items | 37.7200μs | 4.3245μs | 231.2389 KOps/s | 236.8023 KOps/s | |
test_items_nested | 0.6921ms | 0.3933ms | 2.5424 KOps/s | 2.4938 KOps/s | |
test_items_nested_locked | 0.5134ms | 0.3924ms | 2.5487 KOps/s | 2.4946 KOps/s | |
test_items_nested_leaf | 0.1595ms | 78.3559μs | 12.7623 KOps/s | 12.7155 KOps/s | |
test_items_stack_nested | 0.5635ms | 0.3929ms | 2.5454 KOps/s | 2.4747 KOps/s | |
test_items_stack_nested_leaf | 0.1856ms | 80.2289μs | 12.4643 KOps/s | 12.3833 KOps/s | |
test_items_stack_nested_locked | 0.5747ms | 0.3922ms | 2.5495 KOps/s | 2.4808 KOps/s | |
test_keys | 29.2840μs | 3.4484μs | 289.9894 KOps/s | 274.2992 KOps/s | |
test_keys_nested | 0.2982ms | 0.1625ms | 6.1548 KOps/s | 6.0843 KOps/s | |
test_keys_nested_locked | 1.6589ms | 0.1683ms | 5.9418 KOps/s | 5.8799 KOps/s | |
test_keys_nested_leaf | 0.2596ms | 0.1428ms | 7.0030 KOps/s | 6.8904 KOps/s | |
test_keys_stack_nested | 0.2650ms | 0.1582ms | 6.3221 KOps/s | 6.2590 KOps/s | |
test_keys_stack_nested_leaf | 0.2958ms | 0.1385ms | 7.2182 KOps/s | 7.1928 KOps/s | |
test_keys_stack_nested_locked | 0.2633ms | 0.1634ms | 6.1185 KOps/s | 6.0273 KOps/s | |
test_values | 7.6322μs | 1.0332μs | 967.8645 KOps/s | 963.8857 KOps/s | |
test_values_nested | 0.1077ms | 61.5967μs | 16.2346 KOps/s | 16.5157 KOps/s | |
test_values_nested_locked | 0.1089ms | 61.2888μs | 16.3162 KOps/s | 16.4716 KOps/s | |
test_values_nested_leaf | 0.1351ms | 70.7995μs | 14.1244 KOps/s | 13.8523 KOps/s | |
test_values_stack_nested | 0.1175ms | 62.4120μs | 16.0226 KOps/s | 15.7842 KOps/s | |
test_values_stack_nested_leaf | 0.1828ms | 69.8309μs | 14.3203 KOps/s | 14.4109 KOps/s | |
test_values_stack_nested_locked | 0.1220ms | 62.4736μs | 16.0068 KOps/s | 15.8973 KOps/s | |
test_membership | 39.3430μs | 0.8702μs | 1.1491 MOps/s | 1.1835 MOps/s | |
test_membership_nested | 31.3780μs | 2.9512μs | 338.8498 KOps/s | 346.5363 KOps/s | |
test_membership_nested_leaf | 39.0320μs | 2.9639μs | 337.3886 KOps/s | 341.4360 KOps/s | |
test_membership_stacked_nested | 32.6010μs | 2.9523μs | 338.7199 KOps/s | 348.7616 KOps/s | |
test_membership_stacked_nested_leaf | 23.0230μs | 2.9200μs | 342.4649 KOps/s | 349.7886 KOps/s | |
test_membership_nested_last | 39.1630μs | 4.3536μs | 229.6934 KOps/s | 230.3081 KOps/s | |
test_membership_nested_leaf_last | 35.7360μs | 4.3965μs | 227.4551 KOps/s | 230.0698 KOps/s | |
test_membership_stacked_nested_last | 39.9640μs | 13.4861μs | 74.1506 KOps/s | 228.7645 KOps/s | |
test_membership_stacked_nested_leaf_last | 48.5300μs | 13.4808μs | 74.1795 KOps/s | 211.9089 KOps/s | |
test_nested_getleaf | 55.2530μs | 10.5379μs | 94.8955 KOps/s | 93.6903 KOps/s | |
test_nested_get | 54.7020μs | 10.2049μs | 97.9926 KOps/s | 101.3345 KOps/s | |
test_stacked_getleaf | 48.2900μs | 10.4828μs | 95.3942 KOps/s | 94.9327 KOps/s | |
test_stacked_get | 45.2240μs | 10.0224μs | 99.7762 KOps/s | 100.9444 KOps/s | |
test_nested_getitemleaf | 49.5220μs | 11.2493μs | 88.8944 KOps/s | 89.9376 KOps/s | |
test_nested_getitem | 50.8640μs | 10.5103μs | 95.1446 KOps/s | 95.2042 KOps/s | |
test_stacked_getitemleaf | 54.3910μs | 10.8197μs | 92.4242 KOps/s | 91.2960 KOps/s | |
test_stacked_getitem | 41.0160μs | 10.3428μs | 96.6852 KOps/s | 97.3567 KOps/s | |
test_lock_nested | 1.9791ms | 0.4557ms | 2.1943 KOps/s | 1.7857 KOps/s | |
test_lock_stack_nested | 0.7108ms | 0.4095ms | 2.4419 KOps/s | 2.3806 KOps/s | |
test_unlock_nested | 0.8556ms | 0.3756ms | 2.6623 KOps/s | 2.6499 KOps/s | |
test_unlock_stack_nested | 0.5473ms | 0.3310ms | 3.0216 KOps/s | 2.9883 KOps/s | |
test_flatten_speed | 0.1803ms | 99.9412μs | 10.0059 KOps/s | 9.9278 KOps/s | |
test_unflatten_speed | 0.9143ms | 0.5270ms | 1.8976 KOps/s | 1.9148 KOps/s | |
test_common_ops | 3.7463ms | 0.8182ms | 1.2223 KOps/s | 1.2962 KOps/s | |
test_creation | 21.8800μs | 2.5089μs | 398.5801 KOps/s | 412.8699 KOps/s | |
test_creation_empty | 63.7290μs | 12.0638μs | 82.8927 KOps/s | 92.4108 KOps/s | |
test_creation_nested_1 | 64.7710μs | 14.8798μs | 67.2054 KOps/s | 71.7682 KOps/s | |
test_creation_nested_2 | 57.0260μs | 19.5417μs | 51.1726 KOps/s | 54.3036 KOps/s | |
test_clone | 0.1493ms | 13.5620μs | 73.7352 KOps/s | 75.5527 KOps/s | |
test_getitem[int] | 1.2876ms | 13.1642μs | 75.9637 KOps/s | 77.2159 KOps/s | |
test_getitem[slice_int] | 0.1523ms | 24.4376μs | 40.9205 KOps/s | 38.5023 KOps/s | |
test_getitem[range] | 0.1962ms | 49.3625μs | 20.2583 KOps/s | 20.5451 KOps/s | |
test_getitem[tuple] | 0.1385ms | 20.3687μs | 49.0950 KOps/s | 49.7488 KOps/s | |
test_getitem[list] | 0.4314ms | 44.6295μs | 22.4067 KOps/s | 23.1564 KOps/s | |
test_setitem_dim[int] | 61.9150μs | 24.8426μs | 40.2534 KOps/s | 42.2155 KOps/s | |
test_setitem_dim[slice_int] | 0.1185ms | 53.2031μs | 18.7959 KOps/s | 20.2286 KOps/s | |
test_setitem_dim[range] | 0.1170ms | 75.2769μs | 13.2843 KOps/s | 14.0284 KOps/s | |
test_setitem_dim[tuple] | 82.5740μs | 40.1773μs | 24.8897 KOps/s | 25.4961 KOps/s | |
test_setitem | 0.2059ms | 21.1078μs | 47.3759 KOps/s | 50.9039 KOps/s | |
test_set | 0.1475ms | 20.2109μs | 49.4782 KOps/s | 52.1267 KOps/s | |
test_set_shared | 1.1855ms | 0.1710ms | 5.8493 KOps/s | 5.8785 KOps/s | |
test_update | 0.2145ms | 23.4218μs | 42.6953 KOps/s | 46.3485 KOps/s | |
test_update_nested | 0.1738ms | 33.8584μs | 29.5347 KOps/s | 32.0270 KOps/s | |
test_update__nested | 0.4729ms | 34.1753μs | 29.2609 KOps/s | 29.1592 KOps/s | |
test_set_nested | 0.1512ms | 22.7262μs | 44.0021 KOps/s | 46.4416 KOps/s | |
test_set_nested_new | 0.1924ms | 27.4273μs | 36.4600 KOps/s | 38.4991 KOps/s | |
test_select | 0.1719ms | 44.1160μs | 22.6675 KOps/s | 23.7116 KOps/s | |
test_select_nested | 0.1324ms | 64.0812μs | 15.6052 KOps/s | 15.7166 KOps/s | |
test_exclude_nested | 0.2033ms | 83.4915μs | 11.9773 KOps/s | 12.1414 KOps/s | |
test_empty[True] | 0.6246ms | 0.4088ms | 2.4462 KOps/s | 2.4282 KOps/s | |
test_empty[False] | 12.8465μs | 1.3776μs | 725.8800 KOps/s | 742.3309 KOps/s | |
test_unbind_speed | 0.4215ms | 0.2697ms | 3.7077 KOps/s | 3.7353 KOps/s | |
test_unbind_speed_stack0 | 0.5271ms | 0.2599ms | 3.8475 KOps/s | 3.8501 KOps/s | |
test_unbind_speed_stack1 | 0.1068s | 0.7804ms | 1.2814 KOps/s | 1.3963 KOps/s | |
test_split | 0.1061s | 1.7960ms | 556.7794 Ops/s | 563.5085 Ops/s | |
test_chunk | 1.8110ms | 1.6136ms | 619.7180 Ops/s | 568.6896 Ops/s | |
test_consolidate_njt[False-None] | 0.1207s | 9.1834ms | 108.8923 Ops/s | 124.0297 Ops/s | |
test_creation[device0] | 0.2849ms | 92.3292μs | 10.8308 KOps/s | 11.1260 KOps/s | |
test_creation_from_tensor | 3.9012ms | 95.4288μs | 10.4790 KOps/s | 10.8841 KOps/s | |
test_add_one[memmap_tensor0] | 0.2250ms | 5.1722μs | 193.3402 KOps/s | 206.6561 KOps/s | |
test_contiguous[memmap_tensor0] | 23.6340μs | 0.5260μs | 1.9012 MOps/s | 1.8672 MOps/s | |
test_stack[memmap_tensor0] | 47.9300μs | 3.4649μs | 288.6117 KOps/s | 289.1816 KOps/s | |
test_memmaptd_index | 1.0972ms | 0.2397ms | 4.1723 KOps/s | 4.2399 KOps/s | |
test_memmaptd_index_astensor | 1.0611ms | 0.3254ms | 3.0727 KOps/s | 3.0928 KOps/s | |
test_memmaptd_index_op | 1.0816ms | 0.6009ms | 1.6640 KOps/s | 1.7365 KOps/s | |
test_serialize_model | 0.1288s | 0.1182s | 8.4574 Ops/s | 8.4890 Ops/s | |
test_serialize_model_pickle | 0.4966s | 0.4007s | 2.4956 Ops/s | 2.4912 Ops/s | |
test_serialize_weights | 0.1267s | 0.1163s | 8.6015 Ops/s | 8.3597 Ops/s | |
test_serialize_weights_returnearly | 0.2740s | 0.1760s | 5.6806 Ops/s | 6.2810 Ops/s | |
test_serialize_weights_pickle | 1.1618s | 0.7406s | 1.3502 Ops/s | 2.5543 Ops/s | |
test_serialize_weights_filesystem | 0.1517s | 0.1444s | 6.9232 Ops/s | 7.0670 Ops/s | |
test_serialize_model_filesystem | 0.1588s | 0.1445s | 6.9187 Ops/s | 6.7235 Ops/s | |
test_reshape_pytree | 61.1540μs | 26.3337μs | 37.9742 KOps/s | 37.9883 KOps/s | |
test_reshape_td | 69.8800μs | 32.5236μs | 30.7469 KOps/s | 30.5430 KOps/s | |
test_view_pytree | 77.5050μs | 26.2409μs | 38.1085 KOps/s | 37.9296 KOps/s | |
test_view_td | 0.1178ms | 37.4172μs | 26.7257 KOps/s | 25.7272 KOps/s | |
test_unbind_pytree | 65.0110μs | 29.5438μs | 33.8480 KOps/s | 34.0477 KOps/s | |
test_unbind_td | 0.3031ms | 40.6402μs | 24.6062 KOps/s | 25.5758 KOps/s | |
test_split_pytree | 72.8160μs | 29.1169μs | 34.3443 KOps/s | 34.0882 KOps/s | |
test_split_td | 0.4518ms | 45.5556μs | 21.9512 KOps/s | 22.2593 KOps/s | |
test_add_pytree | 0.1301ms | 35.2958μs | 28.3320 KOps/s | 28.8523 KOps/s | |
test_add_td | 0.1343ms | 57.2359μs | 17.4716 KOps/s | 18.6527 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1237ms | 63.2471μs | 15.8110 KOps/s | 15.4927 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.4067ms | 0.1722ms | 5.8084 KOps/s | 5.7794 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1198ms | 46.2331μs | 21.6295 KOps/s | 21.3947 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.1857ms | 0.1172ms | 8.5361 KOps/s | 8.4895 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1071ms | 26.1445μs | 38.2489 KOps/s | 39.1196 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1176ms | 58.6414μs | 17.0528 KOps/s | 16.8583 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1537ms | 77.6401μs | 12.8799 KOps/s | 12.0905 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1464ms | 67.0333μs | 14.9180 KOps/s | 14.6902 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2118ms | 0.1048ms | 9.5416 KOps/s | 9.4536 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4566ms | 0.2156ms | 4.6378 KOps/s | 4.7071 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1172ms | 45.0135μs | 22.2156 KOps/s | 21.2490 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4723ms | 67.7106μs | 14.7687 KOps/s | 15.2002 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1926ms | 0.1028ms | 9.7244 KOps/s | 9.6671 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4572ms | 0.2013ms | 4.9687 KOps/s | 5.0051 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3467ms | 0.2320ms | 4.3101 KOps/s | 4.3184 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2182ms | 0.1058ms | 9.4487 KOps/s | 9.4329 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1367ms | 63.3176μs | 15.7934 KOps/s | 16.3281 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1107ms | 46.6599μs | 21.4317 KOps/s | 21.4726 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5953ms | 0.1590ms | 6.2896 KOps/s | 6.2984 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2056ms | 0.1033ms | 9.6822 KOps/s | 9.7223 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 72.4750μs | 21.4373μs | 46.6477 KOps/s | 47.4646 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1345ms | 65.6880μs | 15.2235 KOps/s | 14.7875 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1571ms | 79.2740μs | 12.6145 KOps/s | 12.6214 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1336ms | 68.1989μs | 14.6630 KOps/s | 14.7334 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.4277ms | 0.2147ms | 4.6568 KOps/s | 4.8119 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.0078ms | 1.3274ms | 753.3707 Ops/s | 776.9458 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.4094ms | 0.2082ms | 4.8029 KOps/s | 4.8870 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3606ms | 0.7933ms | 1.2605 KOps/s | 1.3099 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.6449ms | 0.4590ms | 2.1784 KOps/s | 2.2389 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.8230ms | 2.7816ms | 359.5053 Ops/s | 388.2824 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1324ms | 38.1098μs | 26.2400 KOps/s | 27.4454 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.7070ms | 34.0577μs | 29.3619 KOps/s | 29.3161 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 98.5430μs | 29.7488μs | 33.6147 KOps/s | 33.4207 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 67.0950μs | 23.0171μs | 43.4460 KOps/s | 42.6776 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1074ms | 30.8910μs | 32.3719 KOps/s | 32.8492 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 79.7180μs | 23.0882μs | 43.3121 KOps/s | 42.7288 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1179ms | 53.1091μs | 18.8292 KOps/s | 19.6910 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5802ms | 20.4890μs | 48.8068 KOps/s | 51.2568 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1295ms | 45.1207μs | 22.1628 KOps/s | 22.6796 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 71.8050μs | 18.7322μs | 53.3840 KOps/s | 54.2986 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1406ms | 45.8062μs | 21.8311 KOps/s | 22.0855 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 88.1040μs | 18.5503μs | 53.9074 KOps/s | 53.6956 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1282ms | 53.4900μs | 18.6951 KOps/s | 19.3177 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9034ms | 20.3666μs | 49.1000 KOps/s | 50.7605 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 95.3180μs | 45.5694μs | 21.9445 KOps/s | 22.3883 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 75.4800μs | 18.4332μs | 54.2499 KOps/s | 54.3743 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1106ms | 45.4656μs | 21.9946 KOps/s | 22.1675 KOps/s | |
test_compile_indexing[int-pytree-eager] | 84.4570μs | 18.5723μs | 53.8437 KOps/s | 53.6164 KOps/s | |
test_mod_add[eager] | 0.1463ms | 35.3621μs | 28.2788 KOps/s | 29.1532 KOps/s | |
test_mod_add[compile] | 0.1019ms | 48.6654μs | 20.5485 KOps/s | 20.6163 KOps/s | |
test_mod_add[compile-overhead] | 0.1255ms | 49.6164μs | 20.1546 KOps/s | 20.2728 KOps/s | |
test_mod_wrap[eager] | 0.3630ms | 0.2275ms | 4.3962 KOps/s | 4.5289 KOps/s | |
test_mod_wrap[compile] | 0.3282ms | 0.2081ms | 4.8047 KOps/s | 4.8960 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4161ms | 0.2104ms | 4.7522 KOps/s | 4.8868 KOps/s | |
test_mod_wrap_and_backward[eager] | 22.7342ms | 12.5257ms | 79.8361 Ops/s | 83.9039 Ops/s | |
test_mod_wrap_and_backward[compile] | 20.4316ms | 14.0616ms | 71.1159 Ops/s | 75.0372 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 15.3572ms | 13.4165ms | 74.5351 Ops/s | 81.8808 Ops/s | |
test_seq_add[eager] | 0.2260ms | 0.1187ms | 8.4225 KOps/s | 8.5666 KOps/s | |
test_seq_add[compile] | 0.1365ms | 64.2160μs | 15.5725 KOps/s | 16.2304 KOps/s | |
test_seq_add[compile-overhead] | 0.1564ms | 63.0999μs | 15.8479 KOps/s | 16.6093 KOps/s | |
test_seq_wrap[eager] | 0.7734ms | 0.4572ms | 2.1875 KOps/s | 2.2591 KOps/s | |
test_seq_wrap[compile] | 0.4466ms | 0.2320ms | 4.3105 KOps/s | 4.3804 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4625ms | 0.2317ms | 4.3165 KOps/s | 4.3827 KOps/s | |
test_func_call_runtime[False-eager] | 0.9631ms | 0.5500ms | 1.8183 KOps/s | 1.8791 KOps/s | |
test_func_call_runtime[False-compile] | 0.5792ms | 0.4299ms | 2.3263 KOps/s | 2.3553 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5592ms | 0.4293ms | 2.3293 KOps/s | 2.3320 KOps/s | |
test_func_call_runtime[True-eager] | 1.0911ms | 0.7618ms | 1.3127 KOps/s | 1.3315 KOps/s | |
test_func_call_runtime[True-compile] | 0.5753ms | 0.4639ms | 2.1557 KOps/s | 2.1295 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5732ms | 0.4660ms | 2.1460 KOps/s | 2.1379 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9208ms | 0.5481ms | 1.8246 KOps/s | 1.8504 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5433ms | 0.4253ms | 2.3515 KOps/s | 2.3385 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5577ms | 0.4249ms | 2.3532 KOps/s | 2.3313 KOps/s | |
test_func_call_cm_runtime[True-eager] | 2.0374ms | 0.9092ms | 1.0999 KOps/s | 1.1062 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.8695ms | 0.4946ms | 2.0218 KOps/s | 2.0241 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.6255ms | 0.4882ms | 2.0483 KOps/s | 2.0190 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5028ms | 1.9144ms | 522.3602 Ops/s | 515.4130 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9088ms | 0.5198ms | 1.9237 KOps/s | 1.9162 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7182ms | 0.5274ms | 1.8963 KOps/s | 1.9222 KOps/s | |
test_distributed | 0.3233ms | 0.1230ms | 8.1327 KOps/s | 7.8530 KOps/s | |
test_tdmodule | 70.3810μs | 26.8889μs | 37.1900 KOps/s | 38.0870 KOps/s | |
test_tdmodule_dispatch | 76.9530μs | 48.2261μs | 20.7357 KOps/s | 21.4374 KOps/s | |
test_tdseq | 49.1410μs | 28.9327μs | 34.5630 KOps/s | 34.0430 KOps/s | |
test_tdseq_dispatch | 85.1280μs | 53.2894μs | 18.7655 KOps/s | 18.5308 KOps/s | |
test_instantiation_functorch | 2.0329ms | 1.5451ms | 647.2121 Ops/s | 643.4857 Ops/s | |
test_exec_functorch | 0.2770ms | 0.1798ms | 5.5624 KOps/s | 5.5988 KOps/s | |
test_exec_functional_call | 0.3114ms | 0.1753ms | 5.7048 KOps/s | 5.7835 KOps/s | |
test_exec_td_decorator | 0.5039ms | 0.2333ms | 4.2871 KOps/s | 4.2947 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8594ms | 0.6606ms | 1.5139 KOps/s | 1.5475 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0593ms | 0.6587ms | 1.5181 KOps/s | 1.5551 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8599ms | 0.5360ms | 1.8658 KOps/s | 1.9276 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7657ms | 0.5323ms | 1.8786 KOps/s | 1.9271 KOps/s | |
test_to_module_speed[True] | 2.1339ms | 1.3388ms | 746.9354 Ops/s | 743.0005 Ops/s | |
test_to_module_speed[False] | 1.8235ms | 1.2982ms | 770.3113 Ops/s | 761.9770 Ops/s | |
test_tc_init | 84.6970μs | 46.3936μs | 21.5547 KOps/s | 22.3586 KOps/s | |
test_tc_init_nested | 0.1503ms | 92.7745μs | 10.7788 KOps/s | 10.9096 KOps/s | |
test_tc_first_layer_tensor | 19.0150μs | 1.6242μs | 615.6875 KOps/s | 660.0352 KOps/s | |
test_tc_first_layer_nontensor | 32.0600μs | 4.7531μs | 210.3908 KOps/s | 207.4478 KOps/s | |
test_tc_second_layer_tensor | 24.0150μs | 2.9695μs | 336.7581 KOps/s | 350.1884 KOps/s | |
test_tc_second_layer_nontensor | 41.8880μs | 6.1717μs | 162.0305 KOps/s | 163.9613 KOps/s | |
test_unbind | 0.2320s | 14.4359ms | 69.2716 Ops/s | 76.9822 Ops/s | |
test_full_like | 18.8815ms | 13.1647ms | 75.9605 Ops/s | 124.6634 Ops/s | |
test_zeros_like | 13.7611ms | 7.9810ms | 125.2970 Ops/s | 311.3710 Ops/s | |
test_ones_like | 11.6858ms | 8.0307ms | 124.5221 Ops/s | 280.9763 Ops/s | |
test_clone | 14.1646ms | 10.0785ms | 99.2210 Ops/s | 180.7739 Ops/s | |
test_squeeze | 63.3380μs | 12.1812μs | 82.0935 KOps/s | 79.3255 KOps/s | |
test_unsqueeze | 0.2024ms | 90.6855μs | 11.0271 KOps/s | 11.0081 KOps/s | |
test_split | 0.4790ms | 0.1982ms | 5.0460 KOps/s | 5.1560 KOps/s | |
test_permute | 0.2863ms | 0.1932ms | 5.1748 KOps/s | 5.1893 KOps/s | |
test_stack | 30.5786ms | 25.6339ms | 39.0109 Ops/s | 37.5328 Ops/s | |
test_cat | 29.8299ms | 24.9813ms | 40.0300 Ops/s | 37.7729 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 32.0200μs | 11.7966μs | 84.7701 KOps/s | 75.0659 KOps/s | |
test_plain_set_stack_nested | 37.7610μs | 12.0285μs | 83.1361 KOps/s | 73.8430 KOps/s | |
test_plain_set_nested_inplace | 43.0110μs | 12.8731μs | 77.6817 KOps/s | 69.3486 KOps/s | |
test_plain_set_stack_nested_inplace | 39.4910μs | 12.8711μs | 77.6937 KOps/s | 70.9745 KOps/s | |
test_items | 33.4900μs | 2.8429μs | 351.7522 KOps/s | 341.1462 KOps/s | |
test_items_nested | 0.4175ms | 0.3639ms | 2.7484 KOps/s | 2.8007 KOps/s | |
test_items_nested_locked | 0.4413ms | 0.3647ms | 2.7419 KOps/s | 2.8221 KOps/s | |
test_items_nested_leaf | 88.7710μs | 58.7197μs | 17.0301 KOps/s | 17.1140 KOps/s | |
test_items_stack_nested | 0.4141ms | 0.3603ms | 2.7751 KOps/s | 2.7724 KOps/s | |
test_items_stack_nested_leaf | 94.2410μs | 60.2529μs | 16.5967 KOps/s | 16.3451 KOps/s | |
test_items_stack_nested_locked | 0.4188ms | 0.3594ms | 2.7823 KOps/s | 2.7794 KOps/s | |
test_keys | 34.8110μs | 3.4338μs | 291.2205 KOps/s | 288.9931 KOps/s | |
test_keys_nested | 0.1302ms | 81.5420μs | 12.2636 KOps/s | 12.1925 KOps/s | |
test_keys_nested_locked | 2.6347ms | 87.9103μs | 11.3752 KOps/s | 11.4023 KOps/s | |
test_keys_nested_leaf | 0.1061ms | 72.3465μs | 13.8224 KOps/s | 13.7853 KOps/s | |
test_keys_stack_nested | 0.1280ms | 82.8037μs | 12.0768 KOps/s | 12.0016 KOps/s | |
test_keys_stack_nested_leaf | 0.1485ms | 72.7071μs | 13.7538 KOps/s | 13.5332 KOps/s | |
test_keys_stack_nested_locked | 0.1292ms | 88.2155μs | 11.3359 KOps/s | 11.2478 KOps/s | |
test_values | 7.4602μs | 0.8474μs | 1.1800 MOps/s | 1.1849 MOps/s | |
test_values_nested | 79.1420μs | 34.4084μs | 29.0627 KOps/s | 29.1286 KOps/s | |
test_values_nested_locked | 65.2810μs | 36.3887μs | 27.4811 KOps/s | 27.7020 KOps/s | |
test_values_nested_leaf | 66.9810μs | 39.2058μs | 25.5064 KOps/s | 25.2994 KOps/s | |
test_values_stack_nested | 72.9010μs | 34.5581μs | 28.9367 KOps/s | 28.6493 KOps/s | |
test_values_stack_nested_leaf | 64.6920μs | 39.7803μs | 25.1381 KOps/s | 24.9956 KOps/s | |
test_values_stack_nested_locked | 78.0220μs | 36.9530μs | 27.0614 KOps/s | 27.0381 KOps/s | |
test_membership | 1.6695μs | 0.4995μs | 2.0022 MOps/s | 1.9430 MOps/s | |
test_membership_nested | 26.8100μs | 2.0687μs | 483.3937 KOps/s | 508.4304 KOps/s | |
test_membership_nested_leaf | 20.1505μs | 1.9947μs | 501.3262 KOps/s | 509.3883 KOps/s | |
test_membership_stacked_nested | 36.2910μs | 2.0812μs | 480.4924 KOps/s | 492.5789 KOps/s | |
test_membership_stacked_nested_leaf | 32.4210μs | 2.0557μs | 486.4630 KOps/s | 488.4536 KOps/s | |
test_membership_nested_last | 46.9610μs | 3.0715μs | 325.5693 KOps/s | 328.5118 KOps/s | |
test_membership_nested_leaf_last | 38.2710μs | 3.1051μs | 322.0468 KOps/s | 325.3612 KOps/s | |
test_membership_stacked_nested_last | 35.7500μs | 3.5735μs | 279.8414 KOps/s | 281.7871 KOps/s | |
test_membership_stacked_nested_leaf_last | 43.6410μs | 3.5232μs | 283.8343 KOps/s | 282.1248 KOps/s | |
test_nested_getleaf | 32.3100μs | 6.0630μs | 164.9348 KOps/s | 162.5365 KOps/s | |
test_nested_get | 35.4710μs | 5.7492μs | 173.9369 KOps/s | 171.4239 KOps/s | |
test_stacked_getleaf | 27.4000μs | 6.0778μs | 164.5345 KOps/s | 162.5118 KOps/s | |
test_stacked_get | 51.3310μs | 5.8541μs | 170.8210 KOps/s | 172.3957 KOps/s | |
test_nested_getitemleaf | 36.5900μs | 6.1629μs | 162.2603 KOps/s | 160.3481 KOps/s | |
test_nested_getitem | 45.9210μs | 5.9311μs | 168.6023 KOps/s | 169.9301 KOps/s | |
test_stacked_getitemleaf | 35.8810μs | 6.1811μs | 161.7825 KOps/s | 162.2352 KOps/s | |
test_stacked_getitem | 43.2300μs | 5.8643μs | 170.5228 KOps/s | 167.9578 KOps/s | |
test_lock_nested | 4.4931ms | 0.3729ms | 2.6819 KOps/s | 2.6236 KOps/s | |
test_lock_stack_nested | 0.4045ms | 0.3424ms | 2.9202 KOps/s | 2.8355 KOps/s | |
test_unlock_nested | 0.7509ms | 0.3115ms | 3.2100 KOps/s | 3.0922 KOps/s | |
test_unlock_stack_nested | 0.3198ms | 0.2803ms | 3.5679 KOps/s | 3.4430 KOps/s | |
test_flatten_speed | 0.1071ms | 74.8190μs | 13.3656 KOps/s | 13.0720 KOps/s | |
test_unflatten_speed | 0.3928ms | 0.3239ms | 3.0875 KOps/s | 3.0791 KOps/s | |
test_common_ops | 1.6388ms | 0.5903ms | 1.6941 KOps/s | 1.5261 KOps/s | |
test_creation | 0.1867ms | 1.7469μs | 572.4531 KOps/s | 584.5570 KOps/s | |
test_creation_empty | 35.6200μs | 7.2909μs | 137.1569 KOps/s | 97.6263 KOps/s | |
test_creation_nested_1 | 46.3800μs | 8.9783μs | 111.3791 KOps/s | 84.0858 KOps/s | |
test_creation_nested_2 | 36.0200μs | 11.6623μs | 85.7465 KOps/s | 68.3968 KOps/s | |
test_clone | 0.1260ms | 10.4319μs | 95.8600 KOps/s | 88.2050 KOps/s | |
test_getitem[int] | 72.4510μs | 10.4837μs | 95.3861 KOps/s | 93.2847 KOps/s | |
test_getitem[slice_int] | 1.1669ms | 20.8444μs | 47.9746 KOps/s | 47.0398 KOps/s | |
test_getitem[range] | 0.1283ms | 36.6675μs | 27.2721 KOps/s | 26.1615 KOps/s | |
test_getitem[tuple] | 0.1113ms | 18.2118μs | 54.9095 KOps/s | 53.7871 KOps/s | |
test_getitem[list] | 0.1262ms | 31.6957μs | 31.5500 KOps/s | 29.6013 KOps/s | |
test_setitem_dim[int] | 37.4810μs | 18.0178μs | 55.5006 KOps/s | 51.8179 KOps/s | |
test_setitem_dim[slice_int] | 65.6610μs | 37.8281μs | 26.4354 KOps/s | 25.6835 KOps/s | |
test_setitem_dim[range] | 72.6110μs | 50.3261μs | 19.8704 KOps/s | 18.8356 KOps/s | |
test_setitem_dim[tuple] | 60.3810μs | 31.0971μs | 32.1574 KOps/s | 30.4546 KOps/s | |
test_setitem | 0.1145ms | 14.4893μs | 69.0162 KOps/s | 58.8688 KOps/s | |
test_set | 0.1291ms | 14.2249μs | 70.2991 KOps/s | 59.9512 KOps/s | |
test_set_shared | 1.5047ms | 0.1511ms | 6.6163 KOps/s | 6.5235 KOps/s | |
test_update | 0.3056ms | 16.2118μs | 61.6834 KOps/s | 48.9977 KOps/s | |
test_update_nested | 63.5810μs | 21.9212μs | 45.6179 KOps/s | 38.2943 KOps/s | |
test_update__nested | 0.4426ms | 26.3620μs | 37.9334 KOps/s | 36.7790 KOps/s | |
test_set_nested | 0.1096ms | 16.4001μs | 60.9753 KOps/s | 56.5433 KOps/s | |
test_set_nested_new | 0.1114ms | 18.1625μs | 55.0585 KOps/s | 49.8896 KOps/s | |
test_select | 0.1235ms | 28.9487μs | 34.5439 KOps/s | 31.1269 KOps/s | |
test_select_nested | 73.6710μs | 44.5303μs | 22.4566 KOps/s | 22.9048 KOps/s | |
test_exclude_nested | 96.0310μs | 63.6664μs | 15.7069 KOps/s | 15.8586 KOps/s | |
test_empty[True] | 0.3714ms | 0.2895ms | 3.4542 KOps/s | 3.4880 KOps/s | |
test_empty[False] | 3.3311μs | 0.8281μs | 1.2075 MOps/s | 1.2091 MOps/s | |
test_to | 84.9020μs | 55.1585μs | 18.1296 KOps/s | 17.3097 KOps/s | |
test_to_nonblocking | 95.8220μs | 47.4503μs | 21.0747 KOps/s | 20.5877 KOps/s | |
test_unbind_speed | 1.7349ms | 0.2361ms | 4.2349 KOps/s | 4.1678 KOps/s | |
test_unbind_speed_stack0 | 0.3102ms | 0.2356ms | 4.2449 KOps/s | 4.0897 KOps/s | |
test_unbind_speed_stack1 | 92.8162ms | 0.6645ms | 1.5050 KOps/s | 1.4772 KOps/s | |
test_split | 93.4323ms | 1.5627ms | 639.9080 Ops/s | 622.5076 Ops/s | |
test_chunk | 95.7041ms | 1.5751ms | 634.8895 Ops/s | 620.6517 Ops/s | |
test_consolidate[False-None] | 96.5292ms | 2.9073ms | 343.9607 Ops/s | 343.4048 Ops/s | |
test_consolidate[default-None] | 1.8019ms | 1.6875ms | 592.5809 Ops/s | 600.0391 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8656ms | 1.7254ms | 579.5636 Ops/s | 590.1325 Ops/s | |
test_consolidate_njt[False-None] | 6.9741ms | 6.5857ms | 151.8438 Ops/s | 156.3867 Ops/s | |
test_to[False-False-None] | 1.7666ms | 1.6968ms | 589.3332 Ops/s | 584.8969 Ops/s | |
test_to[True-False-None] | 1.5301ms | 1.2928ms | 773.5433 Ops/s | 762.3038 Ops/s | |
test_to[within-False-None] | 4.3253ms | 4.0499ms | 246.9171 Ops/s | 241.6282 Ops/s | |
test_to[True-default-None] | 5.4370ms | 5.1917ms | 192.6163 Ops/s | 192.9919 Ops/s | |
test_to_njt[False-False-None] | 6.9437ms | 6.8309ms | 146.3930 Ops/s | 146.6726 Ops/s | |
test_to_njt[True-False-None] | 5.9479ms | 5.4157ms | 184.6490 Ops/s | 181.8270 Ops/s | |
test_to_njt[within-False-None] | 12.2815ms | 12.0208ms | 83.1894 Ops/s | 82.5779 Ops/s | |
test_creation[device0] | 0.6385ms | 80.7020μs | 12.3913 KOps/s | 12.1392 KOps/s | |
test_creation_from_tensor | 0.5003ms | 83.4912μs | 11.9773 KOps/s | 12.0013 KOps/s | |
test_add_one[memmap_tensor0] | 0.4292ms | 6.4740μs | 154.4645 KOps/s | 143.0150 KOps/s | |
test_contiguous[memmap_tensor0] | 2.4580μs | 0.4031μs | 2.4810 MOps/s | 2.3757 MOps/s | |
test_stack[memmap_tensor0] | 40.6210μs | 4.3221μs | 231.3710 KOps/s | 225.2567 KOps/s | |
test_memmaptd_index | 2.0382ms | 0.2467ms | 4.0541 KOps/s | 3.9673 KOps/s | |
test_memmaptd_index_astensor | 0.9516ms | 0.3065ms | 3.2627 KOps/s | 3.1875 KOps/s | |
test_memmaptd_index_op | 0.9960ms | 0.5523ms | 1.8106 KOps/s | 1.5912 KOps/s | |
test_serialize_model | 0.1325s | 0.1317s | 7.5932 Ops/s | 7.6085 Ops/s | |
test_serialize_model_pickle | 1.3591s | 1.2160s | 0.8224 Ops/s | 0.8248 Ops/s | |
test_serialize_weights | 0.1307s | 0.1298s | 7.7024 Ops/s | 7.6750 Ops/s | |
test_serialize_weights_returnearly | 0.3284s | 53.2505ms | 18.7792 Ops/s | 14.3516 Ops/s | |
test_serialize_weights_pickle | 1.3766s | 1.2168s | 0.8218 Ops/s | 0.8234 Ops/s | |
test_reshape_pytree | 65.3510μs | 21.7350μs | 46.0087 KOps/s | 42.9900 KOps/s | |
test_reshape_td | 57.2710μs | 26.4972μs | 37.7399 KOps/s | 36.1774 KOps/s | |
test_view_pytree | 51.4110μs | 21.6890μs | 46.1063 KOps/s | 44.7535 KOps/s | |
test_view_td | 64.5320μs | 29.7061μs | 33.6631 KOps/s | 30.9914 KOps/s | |
test_unbind_pytree | 68.9120μs | 27.7236μs | 36.0704 KOps/s | 35.4405 KOps/s | |
test_unbind_td | 0.5786ms | 35.9478μs | 27.8181 KOps/s | 26.7701 KOps/s | |
test_split_pytree | 61.5910μs | 29.2164μs | 34.2274 KOps/s | 33.3392 KOps/s | |
test_split_td | 0.7434ms | 37.5755μs | 26.6131 KOps/s | 25.2341 KOps/s | |
test_add_pytree | 61.4220μs | 33.4491μs | 29.8962 KOps/s | 28.0275 KOps/s | |
test_add_td | 96.0210μs | 47.2575μs | 21.1607 KOps/s | 18.1022 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1719ms | 0.1195ms | 8.3655 KOps/s | 8.1092 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2279ms | 0.1321ms | 7.5675 KOps/s | 7.3289 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1942ms | 97.1268μs | 10.2958 KOps/s | 10.2663 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.3633ms | 0.1487ms | 6.7255 KOps/s | 6.4380 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 66.5210μs | 23.1496μs | 43.1973 KOps/s | 43.4256 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1018ms | 29.4728μs | 33.9296 KOps/s | 33.7882 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1055ms | 65.0206μs | 15.3797 KOps/s | 15.2035 KOps/s | |
test_compile_copy_nested[pytree-eager] | 83.2910μs | 49.0406μs | 20.3913 KOps/s | 19.9989 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1822ms | 0.1420ms | 7.0403 KOps/s | 7.0647 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3187ms | 0.2171ms | 4.6070 KOps/s | 4.6524 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1371ms | 98.9602μs | 10.1051 KOps/s | 9.8751 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1124ms | 54.6572μs | 18.2959 KOps/s | 17.9259 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1735ms | 0.1350ms | 7.4048 KOps/s | 7.3526 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5475ms | 0.4818ms | 2.0757 KOps/s | 1.9742 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3837ms | 0.2602ms | 3.8436 KOps/s | 3.8179 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2293ms | 0.1442ms | 6.9352 KOps/s | 6.9785 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1657ms | 65.7095μs | 15.2185 KOps/s | 14.4351 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1768ms | 99.6340μs | 10.0367 KOps/s | 10.2186 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4528ms | 0.4062ms | 2.4618 KOps/s | 2.4215 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1773ms | 0.1357ms | 7.3687 KOps/s | 7.4643 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 54.5010μs | 18.7877μs | 53.2264 KOps/s | 55.3228 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 59.4810μs | 31.2555μs | 31.9944 KOps/s | 31.7527 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1746ms | 69.4981μs | 14.3889 KOps/s | 14.2975 KOps/s | |
test_compile_copy_flat[pytree-eager] | 74.8310μs | 51.1521μs | 19.5496 KOps/s | 19.1530 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6514ms | 0.3952ms | 2.5306 KOps/s | 2.2094 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.6996ms | 2.5643ms | 389.9701 Ops/s | 376.3144 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.5592ms | 0.4250ms | 2.3529 KOps/s | 2.2839 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.7214ms | 2.6123ms | 382.8091 Ops/s | 373.7672 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.5201ms | 0.1158ms | 8.6387 KOps/s | 8.4568 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5686ms | 77.6105μs | 12.8848 KOps/s | 11.8335 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.4405ms | 0.1076ms | 9.2908 KOps/s | 9.0625 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1097ms | 68.0457μs | 14.6960 KOps/s | 14.1117 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1582ms | 0.1080ms | 9.2595 KOps/s | 9.0284 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1428ms | 71.3303μs | 14.0193 KOps/s | 13.9844 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1454ms | 0.1004ms | 9.9581 KOps/s | 9.6243 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1419ms | 17.9505μs | 55.7087 KOps/s | 57.3932 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1368ms | 94.6133μs | 10.5693 KOps/s | 10.3344 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 46.2810μs | 15.7461μs | 63.5077 KOps/s | 60.0322 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1431ms | 94.4123μs | 10.5918 KOps/s | 10.0584 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 45.1800μs | 15.6225μs | 64.0104 KOps/s | 62.7336 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1665ms | 99.1223μs | 10.0885 KOps/s | 9.5418 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5840ms | 17.3788μs | 57.5415 KOps/s | 58.5034 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1492ms | 97.5183μs | 10.2545 KOps/s | 9.9986 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1653ms | 15.6211μs | 64.0161 KOps/s | 62.3401 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1388ms | 96.5968μs | 10.3523 KOps/s | 9.9823 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1182ms | 17.7892μs | 56.2138 KOps/s | 63.5156 KOps/s | |
test_mod_add[eager] | 78.6910μs | 36.5454μs | 27.3633 KOps/s | 24.1129 KOps/s | |
test_mod_add[compile] | 0.1315ms | 78.4092μs | 12.7536 KOps/s | 12.4465 KOps/s | |
test_mod_add[compile-overhead] | 0.3313ms | 0.1747ms | 5.7225 KOps/s | 5.6843 KOps/s | |
test_mod_wrap[eager] | 0.3186ms | 0.2439ms | 4.1008 KOps/s | 3.9717 KOps/s | |
test_mod_wrap[compile] | 0.3484ms | 0.2875ms | 3.4779 KOps/s | 3.5379 KOps/s | |
test_mod_wrap[compile-overhead] | 7.0850ms | 3.7695ms | 265.2882 Ops/s | 278.7667 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4821ms | 1.3438ms | 744.1623 Ops/s | 687.4526 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4024ms | 1.2576ms | 795.1784 Ops/s | 732.2027 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3885ms | 0.9303ms | 1.0749 KOps/s | 964.1262 Ops/s | |
test_seq_add[eager] | 0.5227ms | 0.1149ms | 8.7048 KOps/s | 8.2191 KOps/s | |
test_seq_add[compile] | 0.4855ms | 87.7108μs | 11.4011 KOps/s | 11.4089 KOps/s | |
test_seq_add[compile-overhead] | 0.1824ms | 0.1292ms | 7.7408 KOps/s | 7.3493 KOps/s | |
test_seq_wrap[eager] | 0.8440ms | 0.4111ms | 2.4325 KOps/s | 2.2947 KOps/s | |
test_seq_wrap[compile] | 0.7152ms | 0.2930ms | 3.4129 KOps/s | 3.2977 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2764ms | 0.2235ms | 4.4741 KOps/s | 4.4243 KOps/s | |
test_func_call_runtime[False-eager] | 1.2205ms | 0.7446ms | 1.3431 KOps/s | 1.3559 KOps/s | |
test_func_call_runtime[False-compile] | 1.1619ms | 0.7305ms | 1.3689 KOps/s | 1.3084 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8220ms | 0.3614ms | 2.7673 KOps/s | 2.7620 KOps/s | |
test_func_call_runtime[True-eager] | 1.3205ms | 0.8900ms | 1.1236 KOps/s | 1.1081 KOps/s | |
test_func_call_runtime[True-compile] | 1.1848ms | 0.7511ms | 1.3314 KOps/s | 1.3149 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5165ms | 0.3792ms | 2.6371 KOps/s | 2.6048 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8330ms | 0.7192ms | 1.3904 KOps/s | 1.3541 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8946ms | 0.7319ms | 1.3662 KOps/s | 1.3410 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4374ms | 0.3605ms | 2.7743 KOps/s | 2.7391 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1379ms | 0.9899ms | 1.0102 KOps/s | 984.7168 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9213ms | 0.7746ms | 1.2910 KOps/s | 1.2505 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4832ms | 0.4069ms | 2.4576 KOps/s | 2.4324 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5443ms | 2.0746ms | 482.0159 Ops/s | 478.7462 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.2398ms | 0.7902ms | 1.2656 KOps/s | 1.2409 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.8395ms | 0.4097ms | 2.4407 KOps/s | 2.4207 KOps/s | |
test_distributed | 4.1250ms | 0.3156ms | 3.1690 KOps/s | 8.4237 KOps/s | |
test_tdmodule | 52.2110μs | 19.2327μs | 51.9949 KOps/s | 46.1873 KOps/s | |
test_tdmodule_dispatch | 78.6810μs | 34.8571μs | 28.6885 KOps/s | 26.1000 KOps/s | |
test_tdseq | 50.6100μs | 20.2736μs | 49.3253 KOps/s | 45.1637 KOps/s | |
test_tdseq_dispatch | 68.4210μs | 37.3348μs | 26.7847 KOps/s | 24.7387 KOps/s | |
test_instantiation_functorch | 1.7576ms | 1.5603ms | 640.9050 Ops/s | 635.8051 Ops/s | |
test_exec_functorch | 0.1877ms | 0.1424ms | 7.0235 KOps/s | 6.7321 KOps/s | |
test_exec_functional_call | 0.1708ms | 0.1336ms | 7.4860 KOps/s | 7.1473 KOps/s | |
test_exec_td_decorator | 0.3684ms | 0.1808ms | 5.5307 KOps/s | 5.2908 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7866ms | 0.6937ms | 1.4414 KOps/s | 1.4519 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8474ms | 0.6851ms | 1.4595 KOps/s | 1.4464 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7044ms | 0.5932ms | 1.6857 KOps/s | 1.6748 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7099ms | 0.5944ms | 1.6824 KOps/s | 1.6779 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.1699ms | 19.0453ms | 52.5063 Ops/s | 52.2652 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.7993ms | 19.0641ms | 52.4547 Ops/s | 52.3178 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.1539ms | 19.0008ms | 52.6295 Ops/s | 52.8868 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.7736ms | 18.9958ms | 52.6431 Ops/s | 52.8790 Ops/s | |
test_to_module_speed[True] | 1.1615ms | 0.9763ms | 1.0243 KOps/s | 1.0253 KOps/s | |
test_to_module_speed[False] | 1.0647ms | 0.9566ms | 1.0454 KOps/s | 1.0389 KOps/s | |
test_tc_init | 69.8110μs | 35.8312μs | 27.9086 KOps/s | 25.7467 KOps/s | |
test_tc_init_nested | 0.2026ms | 70.8514μs | 14.1140 KOps/s | 13.2924 KOps/s | |
test_tc_first_layer_tensor | 6.1586μs | 0.6796μs | 1.4715 MOps/s | 1.4010 MOps/s | |
test_tc_first_layer_nontensor | 26.8810μs | 2.2428μs | 445.8740 KOps/s | 442.1256 KOps/s | |
test_tc_second_layer_tensor | 11.4203μs | 1.4383μs | 695.2615 KOps/s | 703.1631 KOps/s | |
test_tc_second_layer_nontensor | 31.5710μs | 2.9949μs | 333.8968 KOps/s | 330.5956 KOps/s | |
test_unbind | 0.2220s | 11.6128ms | 86.1119 Ops/s | 140.4174 Ops/s | |
test_full_like | 9.2787ms | 9.0471ms | 110.5325 Ops/s | 110.0388 Ops/s | |
test_zeros_like | 5.4230ms | 4.3145ms | 231.7750 Ops/s | 237.7590 Ops/s | |
test_ones_like | 4.9830ms | 4.2679ms | 234.3080 Ops/s | 231.3949 Ops/s | |
test_clone | 6.6128ms | 6.2884ms | 159.0228 Ops/s | 110.9145 Ops/s | |
test_squeeze | 92.7520μs | 9.5601μs | 104.6016 KOps/s | 100.1645 KOps/s | |
test_unsqueeze | 0.1232ms | 70.5027μs | 14.1839 KOps/s | 14.0340 KOps/s | |
test_split | 0.2920ms | 0.1570ms | 6.3705 KOps/s | 6.0278 KOps/s | |
test_permute | 0.2487ms | 0.1736ms | 5.7592 KOps/s | 5.4819 KOps/s | |
test_stack | 50.4990ms | 50.1713ms | 19.9317 Ops/s | 19.8444 Ops/s | |
test_cat | 50.4541ms | 50.1047ms | 19.9582 Ops/s | 19.9433 Ops/s |
vmoens
added a commit
that referenced
this pull request
Jan 9, 2025
ghstack-source-id: 3dbefb4f5322a944664bbc2d29af7f862cb92342 Pull Request resolved: #1169
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):
__torch_function__
#1169