-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BE] TensorClass stub method check #1174
Merged
Merged
+115
−1
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Jan 9, 2025
This was referenced Jan 9, 2025
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 47.4380μs | 21.4015μs | 46.7257 KOps/s | 49.6161 KOps/s | |
test_plain_set_stack_nested | 77.4450μs | 21.4491μs | 46.6220 KOps/s | 49.3497 KOps/s | |
test_plain_set_nested_inplace | 76.6230μs | 23.7782μs | 42.0553 KOps/s | 45.7764 KOps/s | |
test_plain_set_stack_nested_inplace | 56.9760μs | 23.6154μs | 42.3453 KOps/s | 45.8826 KOps/s | |
test_items | 39.2730μs | 4.1966μs | 238.2887 KOps/s | 239.2115 KOps/s | |
test_items_nested | 0.5904ms | 0.4052ms | 2.4679 KOps/s | 2.5362 KOps/s | |
test_items_nested_locked | 0.8351ms | 0.4095ms | 2.4421 KOps/s | 2.5215 KOps/s | |
test_items_nested_leaf | 0.1474ms | 78.0380μs | 12.8143 KOps/s | 13.0584 KOps/s | |
test_items_stack_nested | 0.7099ms | 0.4129ms | 2.4219 KOps/s | 2.5225 KOps/s | |
test_items_stack_nested_leaf | 0.1364ms | 79.1647μs | 12.6319 KOps/s | 12.6807 KOps/s | |
test_items_stack_nested_locked | 0.5644ms | 0.4091ms | 2.4441 KOps/s | 2.5108 KOps/s | |
test_keys | 40.2650μs | 3.4845μs | 286.9869 KOps/s | 276.1071 KOps/s | |
test_keys_nested | 0.2612ms | 0.1652ms | 6.0543 KOps/s | 6.1629 KOps/s | |
test_keys_nested_locked | 0.7214ms | 0.1707ms | 5.8585 KOps/s | 5.9392 KOps/s | |
test_keys_nested_leaf | 0.2292ms | 0.1443ms | 6.9283 KOps/s | 7.0651 KOps/s | |
test_keys_stack_nested | 0.3000ms | 0.1641ms | 6.0937 KOps/s | 6.2169 KOps/s | |
test_keys_stack_nested_leaf | 0.2332ms | 0.1437ms | 6.9577 KOps/s | 7.2293 KOps/s | |
test_keys_stack_nested_locked | 0.2786ms | 0.1707ms | 5.8579 KOps/s | 6.0485 KOps/s | |
test_values | 7.8968μs | 1.0379μs | 963.4824 KOps/s | 969.2192 KOps/s | |
test_values_nested | 0.1179ms | 63.2223μs | 15.8172 KOps/s | 16.2925 KOps/s | |
test_values_nested_locked | 0.1084ms | 62.1358μs | 16.0938 KOps/s | 16.4893 KOps/s | |
test_values_nested_leaf | 0.1244ms | 71.6901μs | 13.9489 KOps/s | 13.9757 KOps/s | |
test_values_stack_nested | 0.1342ms | 63.2297μs | 15.8154 KOps/s | 15.9227 KOps/s | |
test_values_stack_nested_leaf | 0.1257ms | 72.3124μs | 13.8289 KOps/s | 13.7696 KOps/s | |
test_values_stack_nested_locked | 0.1139ms | 63.4596μs | 15.7581 KOps/s | 16.1076 KOps/s | |
test_membership | 5.3284μs | 0.7238μs | 1.3816 MOps/s | 1.1438 MOps/s | |
test_membership_nested | 25.2980μs | 2.8939μs | 345.5592 KOps/s | 344.8477 KOps/s | |
test_membership_nested_leaf | 23.3540μs | 2.9157μs | 342.9736 KOps/s | 347.0252 KOps/s | |
test_membership_stacked_nested | 21.3590μs | 2.9083μs | 343.8396 KOps/s | 349.5161 KOps/s | |
test_membership_stacked_nested_leaf | 30.0160μs | 2.8790μs | 347.3447 KOps/s | 345.7303 KOps/s | |
test_membership_nested_last | 32.7520μs | 4.4048μs | 227.0270 KOps/s | 233.3854 KOps/s | |
test_membership_nested_leaf_last | 23.5730μs | 4.4248μs | 226.0006 KOps/s | 231.4922 KOps/s | |
test_membership_stacked_nested_last | 30.1560μs | 4.3450μs | 230.1516 KOps/s | 228.0018 KOps/s | |
test_membership_stacked_nested_leaf_last | 25.5180μs | 4.3697μs | 228.8478 KOps/s | 229.7119 KOps/s | |
test_nested_getleaf | 42.4290μs | 10.7394μs | 93.1155 KOps/s | 94.4103 KOps/s | |
test_nested_get | 31.9390μs | 10.3129μs | 96.9655 KOps/s | 101.2444 KOps/s | |
test_stacked_getleaf | 36.8480μs | 10.8499μs | 92.1671 KOps/s | 94.1695 KOps/s | |
test_stacked_get | 33.5430μs | 10.3512μs | 96.6068 KOps/s | 99.7181 KOps/s | |
test_nested_getitemleaf | 36.0880μs | 11.1735μs | 89.4971 KOps/s | 90.0287 KOps/s | |
test_nested_getitem | 32.7410μs | 10.3895μs | 96.2508 KOps/s | 96.7498 KOps/s | |
test_stacked_getitemleaf | 33.3120μs | 11.3044μs | 88.4614 KOps/s | 89.0186 KOps/s | |
test_stacked_getitem | 38.1910μs | 10.5077μs | 95.1679 KOps/s | 96.8100 KOps/s | |
test_lock_nested | 0.7972ms | 0.4577ms | 2.1849 KOps/s | 1.8570 KOps/s | |
test_lock_stack_nested | 0.7115ms | 0.4312ms | 2.3189 KOps/s | 2.4175 KOps/s | |
test_unlock_nested | 0.6898ms | 0.3732ms | 2.6796 KOps/s | 2.7290 KOps/s | |
test_unlock_stack_nested | 0.5968ms | 0.3484ms | 2.8704 KOps/s | 3.0011 KOps/s | |
test_flatten_speed | 0.2132ms | 0.1018ms | 9.8271 KOps/s | 10.1516 KOps/s | |
test_unflatten_speed | 0.9084ms | 0.5218ms | 1.9166 KOps/s | 1.9504 KOps/s | |
test_common_ops | 1.7442ms | 0.8287ms | 1.2067 KOps/s | 1.3157 KOps/s | |
test_creation | 19.8380μs | 2.5150μs | 397.6207 KOps/s | 403.8572 KOps/s | |
test_creation_empty | 37.8710μs | 13.4495μs | 74.3521 KOps/s | 91.1421 KOps/s | |
test_creation_nested_1 | 48.1100μs | 16.5407μs | 60.4568 KOps/s | 72.9855 KOps/s | |
test_creation_nested_2 | 61.2950μs | 21.4645μs | 46.5885 KOps/s | 54.7369 KOps/s | |
test_clone | 1.3768ms | 13.8550μs | 72.1759 KOps/s | 76.5450 KOps/s | |
test_getitem[int] | 0.8208ms | 13.2924μs | 75.2308 KOps/s | 78.1577 KOps/s | |
test_getitem[slice_int] | 0.1369ms | 25.1817μs | 39.7113 KOps/s | 40.2336 KOps/s | |
test_getitem[range] | 0.1765ms | 48.6599μs | 20.5508 KOps/s | 21.5294 KOps/s | |
test_getitem[tuple] | 0.1305ms | 21.2986μs | 46.9515 KOps/s | 48.9440 KOps/s | |
test_getitem[list] | 0.1816ms | 43.2079μs | 23.1439 KOps/s | 23.8982 KOps/s | |
test_setitem_dim[int] | 44.8040μs | 24.5265μs | 40.7722 KOps/s | 40.6030 KOps/s | |
test_setitem_dim[slice_int] | 96.4700μs | 51.2435μs | 19.5147 KOps/s | 19.6723 KOps/s | |
test_setitem_dim[range] | 0.1202ms | 72.1405μs | 13.8618 KOps/s | 13.9816 KOps/s | |
test_setitem_dim[tuple] | 75.6010μs | 39.6593μs | 25.2148 KOps/s | 25.0529 KOps/s | |
test_setitem | 74.3790μs | 22.1176μs | 45.2129 KOps/s | 52.0133 KOps/s | |
test_set | 75.0000μs | 21.4374μs | 46.6474 KOps/s | 53.0557 KOps/s | |
test_set_shared | 1.2790ms | 0.1691ms | 5.9121 KOps/s | 5.9609 KOps/s | |
test_update | 0.1276ms | 24.3884μs | 41.0031 KOps/s | 46.7085 KOps/s | |
test_update_nested | 1.0652ms | 34.3158μs | 29.1411 KOps/s | 31.9608 KOps/s | |
test_update__nested | 88.3950μs | 34.4645μs | 29.0154 KOps/s | 29.8554 KOps/s | |
test_set_nested | 77.5550μs | 23.5497μs | 42.4634 KOps/s | 48.4122 KOps/s | |
test_set_nested_new | 88.2050μs | 29.2143μs | 34.2298 KOps/s | 40.2276 KOps/s | |
test_select | 0.1017ms | 45.3424μs | 22.0544 KOps/s | 24.2813 KOps/s | |
test_select_nested | 0.1437ms | 66.3813μs | 15.0645 KOps/s | 15.9444 KOps/s | |
test_exclude_nested | 0.1740ms | 84.7332μs | 11.8018 KOps/s | 12.3385 KOps/s | |
test_empty[True] | 0.5505ms | 0.4171ms | 2.3977 KOps/s | 2.4762 KOps/s | |
test_empty[False] | 6.8352μs | 1.3574μs | 736.7214 KOps/s | 727.9972 KOps/s | |
test_unbind_speed | 0.3511ms | 0.2711ms | 3.6881 KOps/s | 3.7980 KOps/s | |
test_unbind_speed_stack0 | 0.3751ms | 0.2688ms | 3.7198 KOps/s | 3.8787 KOps/s | |
test_unbind_speed_stack1 | 94.1638ms | 0.7899ms | 1.2659 KOps/s | 1.4189 KOps/s | |
test_split | 97.6400ms | 1.7846ms | 560.3523 Ops/s | 562.3446 Ops/s | |
test_chunk | 97.4766ms | 1.8043ms | 554.2194 Ops/s | 557.3825 Ops/s | |
test_consolidate_njt[False-None] | 11.3821ms | 8.2833ms | 120.7255 Ops/s | 123.8495 Ops/s | |
test_creation[device0] | 0.2525ms | 91.8676μs | 10.8852 KOps/s | 11.1122 KOps/s | |
test_creation_from_tensor | 3.2993ms | 96.0301μs | 10.4134 KOps/s | 10.6661 KOps/s | |
test_add_one[memmap_tensor0] | 0.1633ms | 4.8550μs | 205.9752 KOps/s | 207.5331 KOps/s | |
test_contiguous[memmap_tensor0] | 12.4230μs | 0.5163μs | 1.9369 MOps/s | 1.9335 MOps/s | |
test_stack[memmap_tensor0] | 39.4040μs | 3.3067μs | 302.4184 KOps/s | 289.4768 KOps/s | |
test_memmaptd_index | 0.9164ms | 0.2400ms | 4.1667 KOps/s | 4.1736 KOps/s | |
test_memmaptd_index_astensor | 0.6618ms | 0.3273ms | 3.0556 KOps/s | 3.0397 KOps/s | |
test_memmaptd_index_op | 0.9548ms | 0.6126ms | 1.6324 KOps/s | 1.7041 KOps/s | |
test_serialize_model | 0.1205s | 0.1126s | 8.8843 Ops/s | 7.6311 Ops/s | |
test_serialize_model_pickle | 0.4554s | 0.3881s | 2.5767 Ops/s | 2.5488 Ops/s | |
test_serialize_weights | 0.1209s | 0.1142s | 8.7576 Ops/s | 8.8320 Ops/s | |
test_serialize_weights_returnearly | 0.2571s | 0.1774s | 5.6364 Ops/s | 6.3967 Ops/s | |
test_serialize_weights_pickle | 0.5357s | 0.4282s | 2.3355 Ops/s | 2.4377 Ops/s | |
test_serialize_weights_filesystem | 0.1527s | 0.1417s | 7.0550 Ops/s | 6.8480 Ops/s | |
test_serialize_model_filesystem | 0.1538s | 0.1449s | 6.9020 Ops/s | 6.6680 Ops/s | |
test_reshape_pytree | 68.4380μs | 26.9929μs | 37.0468 KOps/s | 37.8559 KOps/s | |
test_reshape_td | 75.8610μs | 33.4430μs | 29.9016 KOps/s | 30.5630 KOps/s | |
test_view_pytree | 68.5500μs | 27.0820μs | 36.9249 KOps/s | 38.2888 KOps/s | |
test_view_td | 74.1280μs | 38.0696μs | 26.2677 KOps/s | 27.1121 KOps/s | |
test_unbind_pytree | 65.9730μs | 30.3847μs | 32.9113 KOps/s | 33.9499 KOps/s | |
test_unbind_td | 0.3045ms | 39.8671μs | 25.0834 KOps/s | 25.5407 KOps/s | |
test_split_pytree | 69.2500μs | 30.7050μs | 32.5680 KOps/s | 34.0224 KOps/s | |
test_split_td | 0.4692ms | 46.0266μs | 21.7266 KOps/s | 21.9359 KOps/s | |
test_add_pytree | 92.3120μs | 36.9713μs | 27.0480 KOps/s | 28.8264 KOps/s | |
test_add_td | 0.1660ms | 59.4363μs | 16.8247 KOps/s | 18.8392 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1269ms | 64.4846μs | 15.5076 KOps/s | 16.2509 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 1.3788ms | 0.1747ms | 5.7231 KOps/s | 5.8513 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1097ms | 46.6204μs | 21.4498 KOps/s | 22.3911 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2263ms | 0.1203ms | 8.3157 KOps/s | 8.6083 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 96.1020μs | 27.6081μs | 36.2212 KOps/s | 39.0490 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1238ms | 59.2666μs | 16.8729 KOps/s | 16.9562 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1745ms | 80.3302μs | 12.4486 KOps/s | 12.7856 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1503ms | 69.5154μs | 14.3853 KOps/s | 14.9062 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2254ms | 0.1071ms | 9.3359 KOps/s | 9.7384 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4506ms | 0.2181ms | 4.5858 KOps/s | 4.6398 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 99.9770μs | 45.6391μs | 21.9111 KOps/s | 23.6063 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4905ms | 67.5460μs | 14.8047 KOps/s | 15.1368 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2228ms | 0.1029ms | 9.7137 KOps/s | 9.8088 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3549ms | 0.2017ms | 4.9574 KOps/s | 5.0713 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3815ms | 0.2366ms | 4.2263 KOps/s | 4.2137 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1951ms | 0.1074ms | 9.3129 KOps/s | 9.6532 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1497ms | 68.8086μs | 14.5331 KOps/s | 15.6167 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1080ms | 47.7186μs | 20.9562 KOps/s | 21.7095 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.3280ms | 0.1630ms | 6.1352 KOps/s | 6.3892 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2138ms | 0.1024ms | 9.7631 KOps/s | 9.4781 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 59.4710μs | 21.8198μs | 45.8300 KOps/s | 46.6617 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1647ms | 69.4427μs | 14.4004 KOps/s | 15.1096 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1575ms | 81.4561μs | 12.2766 KOps/s | 12.7649 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1492ms | 68.9261μs | 14.5083 KOps/s | 14.5816 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3999ms | 0.2071ms | 4.8286 KOps/s | 4.8885 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.5562ms | 1.3230ms | 755.8743 Ops/s | 769.9825 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3267ms | 0.2036ms | 4.9124 KOps/s | 4.8928 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 0.9664ms | 0.7714ms | 1.2964 KOps/s | 1.2957 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.5896ms | 0.4503ms | 2.2207 KOps/s | 2.2348 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.9398ms | 2.7444ms | 364.3729 Ops/s | 386.9289 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1279ms | 36.1863μs | 27.6348 KOps/s | 28.2233 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.4925ms | 33.8353μs | 29.5549 KOps/s | 30.6823 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 76.5920μs | 29.2609μs | 34.1753 KOps/s | 35.0984 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 66.5840μs | 23.2942μs | 42.9292 KOps/s | 44.3147 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 69.0580μs | 29.4407μs | 33.9666 KOps/s | 33.2241 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 73.8180μs | 23.1663μs | 43.1661 KOps/s | 44.2538 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1177ms | 52.3933μs | 19.0864 KOps/s | 19.6162 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5472ms | 20.6039μs | 48.5345 KOps/s | 49.6388 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 90.4890μs | 44.5429μs | 22.4503 KOps/s | 22.8648 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 57.1260μs | 19.0302μs | 52.5479 KOps/s | 54.4912 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1243ms | 45.0494μs | 22.1979 KOps/s | 22.4582 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 52.6780μs | 19.3021μs | 51.8077 KOps/s | 54.2601 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1081ms | 52.5680μs | 19.0230 KOps/s | 18.9562 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9948ms | 20.3944μs | 49.0332 KOps/s | 49.7878 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1078ms | 45.0412μs | 22.2019 KOps/s | 22.5061 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.2868ms | 19.2248μs | 52.0163 KOps/s | 54.1987 KOps/s | |
test_compile_indexing[int-pytree-compile] | 99.3850μs | 44.9065μs | 22.2685 KOps/s | 22.2967 KOps/s | |
test_compile_indexing[int-pytree-eager] | 68.6380μs | 19.1285μs | 52.2780 KOps/s | 53.3176 KOps/s | |
test_mod_add[eager] | 74.3280μs | 35.3653μs | 28.2763 KOps/s | 30.0758 KOps/s | |
test_mod_add[compile] | 0.1122ms | 48.9510μs | 20.4286 KOps/s | 20.8451 KOps/s | |
test_mod_add[compile-overhead] | 96.6010μs | 49.6721μs | 20.1320 KOps/s | 20.7445 KOps/s | |
test_mod_wrap[eager] | 0.4203ms | 0.2259ms | 4.4275 KOps/s | 4.6031 KOps/s | |
test_mod_wrap[compile] | 0.3014ms | 0.2077ms | 4.8149 KOps/s | 4.9218 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4402ms | 0.2064ms | 4.8441 KOps/s | 4.9275 KOps/s | |
test_mod_wrap_and_backward[eager] | 16.3471ms | 11.5209ms | 86.7991 Ops/s | 94.8103 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.4413ms | 12.5820ms | 79.4789 Ops/s | 80.9722 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 15.8466ms | 12.3760ms | 80.8015 Ops/s | 78.7524 Ops/s | |
test_seq_add[eager] | 0.2755ms | 0.1191ms | 8.3947 KOps/s | 8.8965 KOps/s | |
test_seq_add[compile] | 0.1340ms | 64.4732μs | 15.5103 KOps/s | 16.2601 KOps/s | |
test_seq_add[compile-overhead] | 0.1233ms | 62.9857μs | 15.8766 KOps/s | 16.7808 KOps/s | |
test_seq_wrap[eager] | 0.6286ms | 0.4537ms | 2.2042 KOps/s | 2.2957 KOps/s | |
test_seq_wrap[compile] | 0.4257ms | 0.2293ms | 4.3614 KOps/s | 4.3436 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4254ms | 0.2305ms | 4.3375 KOps/s | 4.3941 KOps/s | |
test_func_call_runtime[False-eager] | 0.8140ms | 0.5527ms | 1.8093 KOps/s | 1.8910 KOps/s | |
test_func_call_runtime[False-compile] | 0.5659ms | 0.4275ms | 2.3393 KOps/s | 2.3878 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8931ms | 0.4345ms | 2.3013 KOps/s | 2.3181 KOps/s | |
test_func_call_runtime[True-eager] | 1.0335ms | 0.7760ms | 1.2887 KOps/s | 1.3385 KOps/s | |
test_func_call_runtime[True-compile] | 0.7720ms | 0.4687ms | 2.1334 KOps/s | 2.2109 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8841ms | 0.4729ms | 2.1145 KOps/s | 2.2011 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.6679ms | 0.5548ms | 1.8026 KOps/s | 1.9188 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5132ms | 0.4263ms | 2.3457 KOps/s | 2.4062 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5191ms | 0.4258ms | 2.3487 KOps/s | 2.3900 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4706ms | 0.9260ms | 1.0799 KOps/s | 1.1404 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6915ms | 0.4953ms | 2.0188 KOps/s | 2.0845 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.9018ms | 0.4960ms | 2.0161 KOps/s | 2.0953 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.3595ms | 1.8947ms | 527.7982 Ops/s | 531.7989 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.0478ms | 0.5190ms | 1.9269 KOps/s | 1.9341 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7556ms | 0.5222ms | 1.9148 KOps/s | 1.9193 KOps/s | |
test_distributed | 0.2607ms | 0.1290ms | 7.7492 KOps/s | 7.9009 KOps/s | |
test_tdmodule | 85.8000μs | 27.7469μs | 36.0401 KOps/s | 38.3294 KOps/s | |
test_tdmodule_dispatch | 94.5360μs | 50.8953μs | 19.6482 KOps/s | 21.0171 KOps/s | |
test_tdseq | 59.3810μs | 31.0231μs | 32.2341 KOps/s | 33.3118 KOps/s | |
test_tdseq_dispatch | 94.7960μs | 57.1598μs | 17.4948 KOps/s | 18.2374 KOps/s | |
test_instantiation_functorch | 1.7987ms | 1.5517ms | 644.4463 Ops/s | 655.2836 Ops/s | |
test_exec_functorch | 0.4483ms | 0.1898ms | 5.2697 KOps/s | 5.6024 KOps/s | |
test_exec_functional_call | 0.4090ms | 0.1818ms | 5.5008 KOps/s | 5.8088 KOps/s | |
test_exec_td_decorator | 0.4531ms | 0.2445ms | 4.0898 KOps/s | 4.3811 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.0259ms | 0.6622ms | 1.5102 KOps/s | 1.5415 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9605ms | 0.6619ms | 1.5108 KOps/s | 1.5532 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8201ms | 0.5303ms | 1.8857 KOps/s | 1.9356 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7623ms | 0.5295ms | 1.8886 KOps/s | 1.9369 KOps/s | |
test_to_module_speed[True] | 2.2096ms | 1.3738ms | 727.9309 Ops/s | 747.3703 Ops/s | |
test_to_module_speed[False] | 1.7062ms | 1.3439ms | 744.1123 Ops/s | 768.0111 Ops/s | |
test_tc_init | 93.7640μs | 49.8490μs | 20.0606 KOps/s | 21.6590 KOps/s | |
test_tc_init_nested | 0.1693ms | 0.1001ms | 9.9949 KOps/s | 11.1928 KOps/s | |
test_tc_first_layer_tensor | 24.1350μs | 1.4934μs | 669.6313 KOps/s | 661.4411 KOps/s | |
test_tc_first_layer_nontensor | 24.8770μs | 4.7462μs | 210.6951 KOps/s | 218.6528 KOps/s | |
test_tc_second_layer_tensor | 18.8950μs | 2.8212μs | 354.4581 KOps/s | 357.8768 KOps/s | |
test_tc_second_layer_nontensor | 24.6760μs | 6.1016μs | 163.8911 KOps/s | 169.0527 KOps/s | |
test_unbind | 0.2106s | 12.9726ms | 77.0855 Ops/s | 80.8176 Ops/s | |
test_full_like | 7.7038ms | 6.9021ms | 144.8838 Ops/s | 146.1678 Ops/s | |
test_zeros_like | 3.0117ms | 2.6713ms | 374.3545 Ops/s | 366.6089 Ops/s | |
test_ones_like | 3.5745ms | 3.0980ms | 322.7940 Ops/s | 321.3847 Ops/s | |
test_clone | 5.5171ms | 4.8442ms | 206.4325 Ops/s | 207.0979 Ops/s | |
test_squeeze | 58.9200μs | 12.8880μs | 77.5917 KOps/s | 82.6628 KOps/s | |
test_unsqueeze | 0.1868ms | 95.4884μs | 10.4725 KOps/s | 11.0306 KOps/s | |
test_split | 0.5545ms | 0.1987ms | 5.0315 KOps/s | 5.1146 KOps/s | |
test_permute | 0.3157ms | 0.2093ms | 4.7772 KOps/s | 4.8293 KOps/s | |
test_stack | 28.1191ms | 24.0537ms | 41.5737 Ops/s | 40.4973 Ops/s | |
test_cat | 27.9462ms | 23.9775ms | 41.7058 Ops/s | 41.1419 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 51.2410μs | 11.3901μs | 87.7958 KOps/s | 74.9412 KOps/s | |
test_plain_set_stack_nested | 50.9910μs | 11.6182μs | 86.0716 KOps/s | 74.0522 KOps/s | |
test_plain_set_nested_inplace | 40.4110μs | 12.5977μs | 79.3795 KOps/s | 69.1061 KOps/s | |
test_plain_set_stack_nested_inplace | 41.8710μs | 12.5783μs | 79.5018 KOps/s | 69.0149 KOps/s | |
test_items | 34.2200μs | 2.9089μs | 343.7744 KOps/s | 334.7314 KOps/s | |
test_items_nested | 0.3993ms | 0.3601ms | 2.7771 KOps/s | 2.8272 KOps/s | |
test_items_nested_locked | 0.4020ms | 0.3613ms | 2.7678 KOps/s | 2.8161 KOps/s | |
test_items_nested_leaf | 84.8710μs | 58.1056μs | 17.2100 KOps/s | 17.2384 KOps/s | |
test_items_stack_nested | 0.4007ms | 0.3631ms | 2.7543 KOps/s | 2.8301 KOps/s | |
test_items_stack_nested_leaf | 83.5820μs | 59.1667μs | 16.9014 KOps/s | 16.5065 KOps/s | |
test_items_stack_nested_locked | 0.3915ms | 0.3604ms | 2.7743 KOps/s | 2.7730 KOps/s | |
test_keys | 26.5610μs | 3.4707μs | 288.1282 KOps/s | 287.7319 KOps/s | |
test_keys_nested | 0.1077ms | 81.4496μs | 12.2775 KOps/s | 12.2149 KOps/s | |
test_keys_nested_locked | 0.7686ms | 87.4425μs | 11.4361 KOps/s | 11.3565 KOps/s | |
test_keys_nested_leaf | 0.1072ms | 72.5706μs | 13.7797 KOps/s | 13.7909 KOps/s | |
test_keys_stack_nested | 0.1141ms | 81.9296μs | 12.2056 KOps/s | 11.9734 KOps/s | |
test_keys_stack_nested_leaf | 0.1057ms | 73.3212μs | 13.6386 KOps/s | 13.4472 KOps/s | |
test_keys_stack_nested_locked | 0.1278ms | 87.7609μs | 11.3946 KOps/s | 11.0968 KOps/s | |
test_values | 8.0335μs | 0.8518μs | 1.1740 MOps/s | 1.1750 MOps/s | |
test_values_nested | 0.1194ms | 34.4298μs | 29.0446 KOps/s | 29.0547 KOps/s | |
test_values_nested_locked | 60.5810μs | 36.0736μs | 27.7211 KOps/s | 27.8276 KOps/s | |
test_values_nested_leaf | 78.2910μs | 39.1714μs | 25.5288 KOps/s | 25.9771 KOps/s | |
test_values_stack_nested | 76.9610μs | 34.8549μs | 28.6904 KOps/s | 28.5162 KOps/s | |
test_values_stack_nested_leaf | 72.8210μs | 39.4899μs | 25.3229 KOps/s | 25.5593 KOps/s | |
test_values_stack_nested_locked | 61.6910μs | 36.4452μs | 27.4385 KOps/s | 27.1329 KOps/s | |
test_membership | 1.6560μs | 0.5077μs | 1.9696 MOps/s | 1.9517 MOps/s | |
test_membership_nested | 16.5805μs | 2.0431μs | 489.4571 KOps/s | 478.0138 KOps/s | |
test_membership_nested_leaf | 14.8600μs | 2.0791μs | 480.9676 KOps/s | 493.0967 KOps/s | |
test_membership_stacked_nested | 39.8310μs | 2.1099μs | 473.9570 KOps/s | 475.3271 KOps/s | |
test_membership_stacked_nested_leaf | 34.3110μs | 2.0992μs | 476.3774 KOps/s | 473.9294 KOps/s | |
test_membership_nested_last | 40.5610μs | 3.1003μs | 322.5521 KOps/s | 317.3517 KOps/s | |
test_membership_nested_leaf_last | 31.9610μs | 3.1200μs | 320.5119 KOps/s | 317.3139 KOps/s | |
test_membership_stacked_nested_last | 44.1410μs | 3.1664μs | 315.8124 KOps/s | 275.1589 KOps/s | |
test_membership_stacked_nested_leaf_last | 34.8710μs | 3.1425μs | 318.2209 KOps/s | 277.9926 KOps/s | |
test_nested_getleaf | 31.7210μs | 6.1736μs | 161.9793 KOps/s | 160.3302 KOps/s | |
test_nested_get | 61.4620μs | 5.8435μs | 171.1310 KOps/s | 170.9814 KOps/s | |
test_stacked_getleaf | 32.4010μs | 6.1803μs | 161.8032 KOps/s | 162.5764 KOps/s | |
test_stacked_get | 27.2200μs | 5.8356μs | 171.3631 KOps/s | 171.4996 KOps/s | |
test_nested_getitemleaf | 28.6100μs | 6.2449μs | 160.1315 KOps/s | 158.7396 KOps/s | |
test_nested_getitem | 30.8210μs | 5.9663μs | 167.6079 KOps/s | 168.6622 KOps/s | |
test_stacked_getitemleaf | 35.3410μs | 6.2044μs | 161.1755 KOps/s | 160.4387 KOps/s | |
test_stacked_getitem | 32.1110μs | 5.8946μs | 169.6460 KOps/s | 168.7471 KOps/s | |
test_lock_nested | 0.7618ms | 0.3730ms | 2.6811 KOps/s | 2.5979 KOps/s | |
test_lock_stack_nested | 0.3726ms | 0.3443ms | 2.9041 KOps/s | 2.7985 KOps/s | |
test_unlock_nested | 0.6060ms | 0.3127ms | 3.1982 KOps/s | 3.0457 KOps/s | |
test_unlock_stack_nested | 0.3135ms | 0.2811ms | 3.5573 KOps/s | 3.3894 KOps/s | |
test_flatten_speed | 0.1305ms | 76.1718μs | 13.1282 KOps/s | 13.1445 KOps/s | |
test_unflatten_speed | 0.3515ms | 0.3197ms | 3.1275 KOps/s | 3.0878 KOps/s | |
test_common_ops | 95.2309ms | 0.6562ms | 1.5238 KOps/s | 1.4885 KOps/s | |
test_creation | 0.1024ms | 1.7413μs | 574.2704 KOps/s | 565.7573 KOps/s | |
test_creation_empty | 34.5500μs | 6.9486μs | 143.9140 KOps/s | 93.0659 KOps/s | |
test_creation_nested_1 | 31.2300μs | 8.6117μs | 116.1204 KOps/s | 80.6591 KOps/s | |
test_creation_nested_2 | 34.3010μs | 11.3114μs | 88.4061 KOps/s | 65.9481 KOps/s | |
test_clone | 75.8120μs | 10.4065μs | 96.0938 KOps/s | 89.2401 KOps/s | |
test_getitem[int] | 2.2641ms | 10.6176μs | 94.1829 KOps/s | 87.6795 KOps/s | |
test_getitem[slice_int] | 0.1092ms | 20.3107μs | 49.2351 KOps/s | 44.6123 KOps/s | |
test_getitem[range] | 0.1339ms | 38.0726μs | 26.2656 KOps/s | 25.2452 KOps/s | |
test_getitem[tuple] | 0.1066ms | 18.0622μs | 55.3642 KOps/s | 50.3992 KOps/s | |
test_getitem[list] | 0.2043ms | 34.6095μs | 28.8938 KOps/s | 28.5676 KOps/s | |
test_setitem_dim[int] | 52.0410μs | 20.2068μs | 49.4882 KOps/s | 48.0335 KOps/s | |
test_setitem_dim[slice_int] | 75.1920μs | 38.6150μs | 25.8966 KOps/s | 24.9038 KOps/s | |
test_setitem_dim[range] | 85.3310μs | 53.2256μs | 18.7879 KOps/s | 18.1487 KOps/s | |
test_setitem_dim[tuple] | 51.4810μs | 32.4804μs | 30.7878 KOps/s | 28.9878 KOps/s | |
test_setitem | 83.0420μs | 14.0810μs | 71.0178 KOps/s | 57.7474 KOps/s | |
test_set | 84.6510μs | 13.7445μs | 72.7565 KOps/s | 59.3364 KOps/s | |
test_set_shared | 1.6713ms | 0.1516ms | 6.5941 KOps/s | 6.5254 KOps/s | |
test_update | 0.4980ms | 15.8051μs | 63.2707 KOps/s | 47.7210 KOps/s | |
test_update_nested | 81.3520μs | 21.4688μs | 46.5793 KOps/s | 37.3340 KOps/s | |
test_update__nested | 0.4967ms | 25.2294μs | 39.6363 KOps/s | 37.9863 KOps/s | |
test_set_nested | 89.4620μs | 14.8902μs | 67.1585 KOps/s | 54.6283 KOps/s | |
test_set_nested_new | 78.9710μs | 16.9519μs | 58.9905 KOps/s | 48.5632 KOps/s | |
test_select | 0.2253ms | 29.6654μs | 33.7093 KOps/s | 30.9766 KOps/s | |
test_select_nested | 78.7120μs | 43.4638μs | 23.0077 KOps/s | 22.6067 KOps/s | |
test_exclude_nested | 96.4720μs | 62.6069μs | 15.9727 KOps/s | 15.7629 KOps/s | |
test_empty[True] | 0.3559ms | 0.2863ms | 3.4924 KOps/s | 3.4242 KOps/s | |
test_empty[False] | 4.4081μs | 0.8228μs | 1.2153 MOps/s | 1.2167 MOps/s | |
test_to | 88.3320μs | 56.6154μs | 17.6630 KOps/s | 17.7518 KOps/s | |
test_to_nonblocking | 0.1137ms | 51.2979μs | 19.4940 KOps/s | 20.4499 KOps/s | |
test_unbind_speed | 0.2853ms | 0.2297ms | 4.3536 KOps/s | 3.9943 KOps/s | |
test_unbind_speed_stack0 | 0.2780ms | 0.2354ms | 4.2477 KOps/s | 4.0004 KOps/s | |
test_unbind_speed_stack1 | 94.7563ms | 0.6663ms | 1.5008 KOps/s | 1.4566 KOps/s | |
test_split | 96.4302ms | 1.7274ms | 578.8950 Ops/s | 610.1147 Ops/s | |
test_chunk | 1.5537ms | 1.4484ms | 690.4182 Ops/s | 608.0736 Ops/s | |
test_consolidate[False-None] | 98.9588ms | 2.9460ms | 339.4469 Ops/s | 326.1982 Ops/s | |
test_consolidate[default-None] | 1.7630ms | 1.6804ms | 595.0906 Ops/s | 569.7617 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8197ms | 1.7445ms | 573.2326 Ops/s | 562.8446 Ops/s | |
test_consolidate_njt[False-None] | 6.7091ms | 6.6173ms | 151.1201 Ops/s | 148.8000 Ops/s | |
test_to[False-False-None] | 1.8126ms | 1.7161ms | 582.7267 Ops/s | 582.0306 Ops/s | |
test_to[True-False-None] | 1.5832ms | 1.3658ms | 732.1473 Ops/s | 723.8304 Ops/s | |
test_to[within-False-None] | 4.4163ms | 4.1696ms | 239.8283 Ops/s | 233.4004 Ops/s | |
test_to[True-default-None] | 5.7947ms | 5.6008ms | 178.5460 Ops/s | 182.5904 Ops/s | |
test_to_njt[False-False-None] | 7.4023ms | 7.2560ms | 137.8174 Ops/s | 140.9465 Ops/s | |
test_to_njt[True-False-None] | 5.9809ms | 5.6305ms | 177.6043 Ops/s | 177.4001 Ops/s | |
test_to_njt[within-False-None] | 12.8470ms | 12.2884ms | 81.3775 Ops/s | 79.8251 Ops/s | |
test_creation[device0] | 0.3786ms | 82.9629μs | 12.0536 KOps/s | 12.1123 KOps/s | |
test_creation_from_tensor | 0.6814ms | 87.0462μs | 11.4881 KOps/s | 11.2524 KOps/s | |
test_add_one[memmap_tensor0] | 0.3137ms | 6.7145μs | 148.9307 KOps/s | 139.4368 KOps/s | |
test_contiguous[memmap_tensor0] | 1.7335μs | 0.4109μs | 2.4338 MOps/s | 2.3988 MOps/s | |
test_stack[memmap_tensor0] | 31.3410μs | 4.4811μs | 223.1571 KOps/s | 205.8992 KOps/s | |
test_memmaptd_index | 1.5587ms | 0.2528ms | 3.9564 KOps/s | 3.7432 KOps/s | |
test_memmaptd_index_astensor | 0.6121ms | 0.3126ms | 3.1993 KOps/s | 3.0574 KOps/s | |
test_memmaptd_index_op | 0.9760ms | 0.5667ms | 1.7647 KOps/s | 1.5297 KOps/s | |
test_serialize_model | 0.1311s | 0.1305s | 7.6628 Ops/s | 7.6618 Ops/s | |
test_serialize_model_pickle | 1.3465s | 1.2130s | 0.8244 Ops/s | 0.8220 Ops/s | |
test_serialize_weights | 0.1317s | 0.1304s | 7.6685 Ops/s | 7.6722 Ops/s | |
test_serialize_weights_returnearly | 0.5486s | 74.5308ms | 13.4173 Ops/s | 13.6022 Ops/s | |
test_serialize_weights_pickle | 1.3767s | 1.2167s | 0.8219 Ops/s | 0.8213 Ops/s | |
test_reshape_pytree | 51.9910μs | 22.2425μs | 44.9590 KOps/s | 42.4182 KOps/s | |
test_reshape_td | 60.7210μs | 27.3423μs | 36.5733 KOps/s | 33.7417 KOps/s | |
test_view_pytree | 78.3020μs | 22.0398μs | 45.3725 KOps/s | 43.9795 KOps/s | |
test_view_td | 67.7410μs | 31.7098μs | 31.5360 KOps/s | 29.8990 KOps/s | |
test_unbind_pytree | 61.2910μs | 27.9173μs | 35.8201 KOps/s | 34.8907 KOps/s | |
test_unbind_td | 0.8457ms | 36.8378μs | 27.1460 KOps/s | 26.3660 KOps/s | |
test_split_pytree | 0.1101ms | 29.5375μs | 33.8553 KOps/s | 32.6649 KOps/s | |
test_split_td | 1.0673ms | 38.0796μs | 26.2608 KOps/s | 24.2839 KOps/s | |
test_add_pytree | 78.1320μs | 34.0604μs | 29.3596 KOps/s | 27.4677 KOps/s | |
test_add_td | 95.9320μs | 45.2812μs | 22.0842 KOps/s | 18.0157 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1738ms | 0.1202ms | 8.3166 KOps/s | 7.9771 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2222ms | 0.1330ms | 7.5165 KOps/s | 7.4779 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2387ms | 94.5861μs | 10.5724 KOps/s | 10.0908 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.9321ms | 0.1484ms | 6.7390 KOps/s | 6.4216 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 73.2510μs | 22.7796μs | 43.8989 KOps/s | 42.2244 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 77.6020μs | 28.9966μs | 34.4868 KOps/s | 33.7056 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2287ms | 62.6954μs | 15.9501 KOps/s | 15.0957 KOps/s | |
test_compile_copy_nested[pytree-eager] | 87.0520μs | 49.1228μs | 20.3572 KOps/s | 20.0388 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2011ms | 0.1417ms | 7.0593 KOps/s | 6.9659 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3131ms | 0.2194ms | 4.5573 KOps/s | 4.5901 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1385ms | 96.9870μs | 10.3107 KOps/s | 10.0443 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.3512ms | 56.3090μs | 17.7592 KOps/s | 17.6236 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1748ms | 0.1352ms | 7.3965 KOps/s | 7.3123 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5153ms | 0.4788ms | 2.0888 KOps/s | 1.9974 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3838ms | 0.2613ms | 3.8270 KOps/s | 3.8099 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2785ms | 0.1551ms | 6.4493 KOps/s | 7.0180 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2109ms | 73.1584μs | 13.6690 KOps/s | 14.1454 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2463ms | 0.1074ms | 9.3074 KOps/s | 9.9304 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5259ms | 0.4081ms | 2.4502 KOps/s | 2.4438 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2510ms | 0.1430ms | 6.9937 KOps/s | 7.4254 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1298ms | 19.5838μs | 51.0626 KOps/s | 53.2676 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1212ms | 31.7081μs | 31.5377 KOps/s | 32.3901 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1793ms | 71.7618μs | 13.9350 KOps/s | 14.2532 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1622ms | 52.4102μs | 19.0803 KOps/s | 19.4518 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6850ms | 0.4065ms | 2.4599 KOps/s | 2.2015 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.8758ms | 2.6813ms | 372.9533 Ops/s | 371.6581 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6258ms | 0.4369ms | 2.2889 KOps/s | 2.2593 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8500ms | 2.6267ms | 380.6993 Ops/s | 364.6643 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1719ms | 0.1136ms | 8.8060 KOps/s | 8.2666 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5686ms | 78.0388μs | 12.8141 KOps/s | 11.6964 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.4443ms | 0.1107ms | 9.0316 KOps/s | 8.9287 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1319ms | 70.4343μs | 14.1976 KOps/s | 13.6401 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2046ms | 0.1122ms | 8.9147 KOps/s | 8.7973 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1228ms | 71.1898μs | 14.0469 KOps/s | 13.6454 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1555ms | 0.1032ms | 9.6897 KOps/s | 9.7113 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1481ms | 17.3144μs | 57.7555 KOps/s | 44.8490 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1291ms | 95.6336μs | 10.4566 KOps/s | 9.6549 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 62.8010μs | 15.9352μs | 62.7540 KOps/s | 59.9834 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1710ms | 95.8995μs | 10.4276 KOps/s | 9.6322 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1700ms | 15.9151μs | 62.8335 KOps/s | 60.7321 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1572ms | 0.1042ms | 9.6008 KOps/s | 9.4928 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6361ms | 17.1517μs | 58.3034 KOps/s | 54.6113 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1754ms | 96.4181μs | 10.3715 KOps/s | 9.7832 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 82.8510μs | 15.7856μs | 63.3491 KOps/s | 60.8174 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2485ms | 95.7865μs | 10.4399 KOps/s | 9.7014 KOps/s | |
test_compile_indexing[int-pytree-eager] | 80.4310μs | 15.6154μs | 64.0392 KOps/s | 59.8874 KOps/s | |
test_mod_add[eager] | 0.1146ms | 37.0823μs | 26.9670 KOps/s | 23.0502 KOps/s | |
test_mod_add[compile] | 0.3310ms | 79.9230μs | 12.5120 KOps/s | 11.5759 KOps/s | |
test_mod_add[compile-overhead] | 0.3298ms | 0.1678ms | 5.9611 KOps/s | 5.6296 KOps/s | |
test_mod_wrap[eager] | 0.3915ms | 0.2511ms | 3.9832 KOps/s | 3.8920 KOps/s | |
test_mod_wrap[compile] | 0.6858ms | 0.2859ms | 3.4979 KOps/s | 3.4120 KOps/s | |
test_mod_wrap[compile-overhead] | 7.1198ms | 3.7329ms | 267.8858 Ops/s | 278.6147 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4775ms | 1.3680ms | 730.9947 Ops/s | 700.2438 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4219ms | 1.2764ms | 783.4603 Ops/s | 770.9679 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4613ms | 0.9685ms | 1.0326 KOps/s | 1.0405 KOps/s | |
test_seq_add[eager] | 0.1821ms | 0.1187ms | 8.4278 KOps/s | 8.0651 KOps/s | |
test_seq_add[compile] | 0.1357ms | 89.1953μs | 11.2113 KOps/s | 11.0738 KOps/s | |
test_seq_add[compile-overhead] | 0.1747ms | 0.1275ms | 7.8403 KOps/s | 7.6224 KOps/s | |
test_seq_wrap[eager] | 0.5376ms | 0.4127ms | 2.4231 KOps/s | 2.2618 KOps/s | |
test_seq_wrap[compile] | 0.4044ms | 0.3075ms | 3.2522 KOps/s | 3.2581 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3105ms | 0.2298ms | 4.3516 KOps/s | 4.3871 KOps/s | |
test_func_call_runtime[False-eager] | 0.9081ms | 0.7845ms | 1.2747 KOps/s | 1.3216 KOps/s | |
test_func_call_runtime[False-compile] | 1.1697ms | 0.7535ms | 1.3272 KOps/s | 1.3116 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4144ms | 0.3686ms | 2.7126 KOps/s | 2.7077 KOps/s | |
test_func_call_runtime[True-eager] | 1.3429ms | 0.9089ms | 1.1002 KOps/s | 1.0829 KOps/s | |
test_func_call_runtime[True-compile] | 1.1926ms | 0.7716ms | 1.2960 KOps/s | 1.2804 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8049ms | 0.3874ms | 2.5811 KOps/s | 2.5278 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.2354ms | 0.7904ms | 1.2652 KOps/s | 1.2931 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.1764ms | 0.7533ms | 1.3275 KOps/s | 1.2203 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4247ms | 0.3661ms | 2.7312 KOps/s | 2.6846 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1739ms | 1.0128ms | 987.3544 Ops/s | 936.5911 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.2186ms | 0.8001ms | 1.2498 KOps/s | 1.1688 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4814ms | 0.4177ms | 2.3942 KOps/s | 2.3095 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5804ms | 2.0790ms | 480.9908 Ops/s | 472.6664 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.2219ms | 0.8099ms | 1.2348 KOps/s | 1.1463 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4623ms | 0.4146ms | 2.4121 KOps/s | 2.3790 KOps/s | |
test_distributed | 13.0008ms | 0.3033ms | 3.2965 KOps/s | 7.2122 KOps/s | |
test_tdmodule | 0.1151ms | 20.2355μs | 49.4180 KOps/s | 46.4755 KOps/s | |
test_tdmodule_dispatch | 65.0010μs | 36.3752μs | 27.4912 KOps/s | 25.6376 KOps/s | |
test_tdseq | 42.0810μs | 21.5782μs | 46.3431 KOps/s | 44.6457 KOps/s | |
test_tdseq_dispatch | 64.1320μs | 39.7370μs | 25.1654 KOps/s | 23.7234 KOps/s | |
test_instantiation_functorch | 1.6850ms | 1.5702ms | 636.8446 Ops/s | 627.4584 Ops/s | |
test_exec_functorch | 0.1874ms | 0.1466ms | 6.8191 KOps/s | 6.6331 KOps/s | |
test_exec_functional_call | 0.2141ms | 0.1404ms | 7.1222 KOps/s | 7.1984 KOps/s | |
test_exec_td_decorator | 0.3860ms | 0.1884ms | 5.3090 KOps/s | 5.2549 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9220ms | 0.7170ms | 1.3947 KOps/s | 1.4265 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8262ms | 0.7151ms | 1.3983 KOps/s | 1.4263 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7331ms | 0.6219ms | 1.6080 KOps/s | 1.6549 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7515ms | 0.6263ms | 1.5967 KOps/s | 1.6531 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.5359ms | 19.2194ms | 52.0308 Ops/s | 51.3963 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.0257ms | 19.3468ms | 51.6883 Ops/s | 51.3800 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.7811ms | 19.1743ms | 52.1531 Ops/s | 51.8237 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 20.0069ms | 19.2944ms | 51.8286 Ops/s | 51.7648 Ops/s | |
test_to_module_speed[True] | 1.5176ms | 0.9609ms | 1.0407 KOps/s | 1.0352 KOps/s | |
test_to_module_speed[False] | 1.5032ms | 0.9441ms | 1.0592 KOps/s | 1.0393 KOps/s | |
test_tc_init | 70.1010μs | 34.4960μs | 28.9889 KOps/s | 24.8883 KOps/s | |
test_tc_init_nested | 0.1663ms | 71.5156μs | 13.9830 KOps/s | 12.4232 KOps/s | |
test_tc_first_layer_tensor | 5.4159μs | 0.7103μs | 1.4078 MOps/s | 1.4432 MOps/s | |
test_tc_first_layer_nontensor | 19.8210μs | 2.2860μs | 437.4534 KOps/s | 451.1323 KOps/s | |
test_tc_second_layer_tensor | 10.6425μs | 1.4207μs | 703.8661 KOps/s | 717.2024 KOps/s | |
test_tc_second_layer_nontensor | 29.8010μs | 3.0248μs | 330.5975 KOps/s | 334.7627 KOps/s | |
test_unbind | 0.2400s | 10.2303ms | 97.7491 Ops/s | 142.5067 Ops/s | |
test_full_like | 9.7193ms | 9.4912ms | 105.3611 Ops/s | 103.4393 Ops/s | |
test_zeros_like | 5.6034ms | 4.3781ms | 228.4110 Ops/s | 114.6767 Ops/s | |
test_ones_like | 5.0253ms | 4.3783ms | 228.3972 Ops/s | 229.5048 Ops/s | |
test_clone | 7.0618ms | 6.7590ms | 147.9511 Ops/s | 147.8579 Ops/s | |
test_squeeze | 85.6220μs | 10.5411μs | 94.8669 KOps/s | 102.7312 KOps/s | |
test_unsqueeze | 0.1281ms | 73.7514μs | 13.5591 KOps/s | 13.2966 KOps/s | |
test_split | 0.3685ms | 0.1604ms | 6.2327 KOps/s | 6.0253 KOps/s | |
test_permute | 0.2275ms | 0.1837ms | 5.4425 KOps/s | 5.6306 KOps/s | |
test_stack | 53.3286ms | 51.6977ms | 19.3432 Ops/s | 19.7840 Ops/s | |
test_cat | 51.6356ms | 51.3598ms | 19.4705 Ops/s | 19.4647 Ops/s |
vmoens
added a commit
that referenced
this pull request
Jan 9, 2025
ghstack-source-id: 5d85310221eb18ca6d0b1c4a4a88557f3fe8819d Pull Request resolved: #1174
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
BE
Better errors, logs, docs or test utils
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):