-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] ProbabilisticTensorDictModule.num_samples #1117
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Nov 29, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 49.8130μs | 18.2524μs | 54.7872 KOps/s | 59.5307 KOps/s | |
test_plain_set_stack_nested | 43.2700μs | 18.0443μs | 55.4193 KOps/s | 58.5082 KOps/s | |
test_plain_set_nested_inplace | 71.8440μs | 19.4496μs | 51.4150 KOps/s | 53.7465 KOps/s | |
test_plain_set_stack_nested_inplace | 69.1290μs | 19.5045μs | 51.2703 KOps/s | 53.8101 KOps/s | |
test_items | 25.3370μs | 4.2051μs | 237.8049 KOps/s | 244.0356 KOps/s | |
test_items_nested | 0.7224ms | 0.3998ms | 2.5015 KOps/s | 2.3411 KOps/s | |
test_items_nested_locked | 0.5838ms | 0.3961ms | 2.5245 KOps/s | 2.3399 KOps/s | |
test_items_nested_leaf | 0.1318ms | 70.8946μs | 14.1054 KOps/s | 13.8725 KOps/s | |
test_items_stack_nested | 0.5253ms | 0.3971ms | 2.5185 KOps/s | 2.3377 KOps/s | |
test_items_stack_nested_leaf | 0.1339ms | 72.6385μs | 13.7668 KOps/s | 13.5332 KOps/s | |
test_items_stack_nested_locked | 0.5819ms | 0.3992ms | 2.5049 KOps/s | 2.3333 KOps/s | |
test_keys | 36.6080μs | 3.4956μs | 286.0704 KOps/s | 283.7522 KOps/s | |
test_keys_nested | 0.1962ms | 0.1404ms | 7.1224 KOps/s | 7.2584 KOps/s | |
test_keys_nested_locked | 1.9273ms | 0.1456ms | 6.8692 KOps/s | 6.9430 KOps/s | |
test_keys_nested_leaf | 0.1746ms | 0.1212ms | 8.2485 KOps/s | 8.4845 KOps/s | |
test_keys_stack_nested | 0.2384ms | 0.1413ms | 7.0777 KOps/s | 7.2652 KOps/s | |
test_keys_stack_nested_leaf | 0.1820ms | 0.1209ms | 8.2702 KOps/s | 8.4353 KOps/s | |
test_keys_stack_nested_locked | 0.2532ms | 0.1463ms | 6.8360 KOps/s | 6.9681 KOps/s | |
test_values | 9.5752μs | 1.0269μs | 973.8070 KOps/s | 955.0587 KOps/s | |
test_values_nested | 0.1104ms | 54.3817μs | 18.3885 KOps/s | 17.9714 KOps/s | |
test_values_nested_locked | 0.1046ms | 54.9767μs | 18.1895 KOps/s | 18.2341 KOps/s | |
test_values_nested_leaf | 0.1316ms | 60.1315μs | 16.6302 KOps/s | 16.1195 KOps/s | |
test_values_stack_nested | 0.1052ms | 54.9453μs | 18.1999 KOps/s | 17.9018 KOps/s | |
test_values_stack_nested_leaf | 0.1294ms | 59.9279μs | 16.6867 KOps/s | 16.4702 KOps/s | |
test_values_stack_nested_locked | 0.1029ms | 55.0845μs | 18.1539 KOps/s | 17.8535 KOps/s | |
test_membership | 2.5352μs | 0.7009μs | 1.4268 MOps/s | 1.3003 MOps/s | |
test_membership_nested | 22.9630μs | 2.8711μs | 348.2987 KOps/s | 328.9494 KOps/s | |
test_membership_nested_leaf | 41.9980μs | 2.8952μs | 345.3958 KOps/s | 326.6830 KOps/s | |
test_membership_stacked_nested | 24.0450μs | 2.8842μs | 346.7203 KOps/s | 322.3288 KOps/s | |
test_membership_stacked_nested_leaf | 42.3390μs | 2.8887μs | 346.1791 KOps/s | 336.5287 KOps/s | |
test_membership_nested_last | 35.9070μs | 4.1289μs | 242.1971 KOps/s | 233.7822 KOps/s | |
test_membership_nested_leaf_last | 49.5920μs | 4.1566μs | 240.5801 KOps/s | 230.7274 KOps/s | |
test_membership_stacked_nested_last | 29.2950μs | 4.2142μs | 237.2952 KOps/s | 200.6822 KOps/s | |
test_membership_stacked_nested_leaf_last | 24.3160μs | 4.1495μs | 240.9909 KOps/s | 200.5599 KOps/s | |
test_nested_getleaf | 54.4710μs | 10.5203μs | 95.0546 KOps/s | 92.3385 KOps/s | |
test_nested_get | 48.5400μs | 10.0593μs | 99.4108 KOps/s | 95.2812 KOps/s | |
test_stacked_getleaf | 53.7400μs | 10.5088μs | 95.1581 KOps/s | 92.7577 KOps/s | |
test_stacked_get | 34.6150μs | 10.0469μs | 99.5330 KOps/s | 95.4647 KOps/s | |
test_nested_getitemleaf | 46.9580μs | 11.0127μs | 90.8041 KOps/s | 90.3196 KOps/s | |
test_nested_getitem | 42.0780μs | 10.2715μs | 97.3566 KOps/s | 91.4464 KOps/s | |
test_stacked_getitemleaf | 38.6350μs | 10.9249μs | 91.5337 KOps/s | 86.9850 KOps/s | |
test_stacked_getitem | 50.2140μs | 10.2674μs | 97.3959 KOps/s | 93.6195 KOps/s | |
test_lock_nested | 3.3934ms | 0.4415ms | 2.2648 KOps/s | 2.2835 KOps/s | |
test_lock_stack_nested | 0.8127ms | 0.4090ms | 2.4448 KOps/s | 2.4547 KOps/s | |
test_unlock_nested | 0.6912ms | 0.3554ms | 2.8137 KOps/s | 2.8259 KOps/s | |
test_unlock_stack_nested | 0.4938ms | 0.3292ms | 3.0372 KOps/s | 3.0803 KOps/s | |
test_flatten_speed | 0.1749ms | 94.8840μs | 10.5392 KOps/s | 10.5460 KOps/s | |
test_unflatten_speed | 0.6622ms | 0.4929ms | 2.0288 KOps/s | 2.0153 KOps/s | |
test_common_ops | 4.4752ms | 0.7721ms | 1.2951 KOps/s | 1.3823 KOps/s | |
test_creation | 14.6680μs | 2.0689μs | 483.3409 KOps/s | 482.7608 KOps/s | |
test_creation_empty | 59.5780μs | 10.8541μs | 92.1313 KOps/s | 111.6712 KOps/s | |
test_creation_nested_1 | 44.0220μs | 13.7161μs | 72.9070 KOps/s | 83.7736 KOps/s | |
test_creation_nested_2 | 73.2090μs | 17.9946μs | 55.5722 KOps/s | 62.4367 KOps/s | |
test_clone | 55.8740μs | 12.6136μs | 79.2792 KOps/s | 76.1416 KOps/s | |
test_getitem[int] | 1.4250ms | 12.3882μs | 80.7218 KOps/s | 79.0219 KOps/s | |
test_getitem[slice_int] | 0.1446ms | 23.8752μs | 41.8844 KOps/s | 40.6862 KOps/s | |
test_getitem[range] | 0.1667ms | 47.8813μs | 20.8850 KOps/s | 21.3262 KOps/s | |
test_getitem[tuple] | 0.1276ms | 19.6911μs | 50.7843 KOps/s | 50.5768 KOps/s | |
test_getitem[list] | 0.1589ms | 43.5905μs | 22.9408 KOps/s | 24.1395 KOps/s | |
test_setitem_dim[int] | 59.7920μs | 25.4597μs | 39.2778 KOps/s | 41.6447 KOps/s | |
test_setitem_dim[slice_int] | 86.1910μs | 52.8998μs | 18.9037 KOps/s | 19.8143 KOps/s | |
test_setitem_dim[range] | 0.1247ms | 74.2751μs | 13.4635 KOps/s | 13.8407 KOps/s | |
test_setitem_dim[tuple] | 62.4060μs | 41.7061μs | 23.9773 KOps/s | 25.0621 KOps/s | |
test_setitem | 72.2550μs | 19.5952μs | 51.0330 KOps/s | 52.8904 KOps/s | |
test_set | 68.4580μs | 19.3141μs | 51.7755 KOps/s | 55.7450 KOps/s | |
test_set_shared | 4.1360ms | 0.1671ms | 5.9834 KOps/s | 6.0633 KOps/s | |
test_update | 0.1209ms | 22.2422μs | 44.9596 KOps/s | 51.0783 KOps/s | |
test_update_nested | 84.7380μs | 31.9217μs | 31.3267 KOps/s | 34.2815 KOps/s | |
test_update__nested | 1.0390ms | 31.5517μs | 31.6940 KOps/s | 31.9920 KOps/s | |
test_set_nested | 88.1150μs | 21.2252μs | 47.1137 KOps/s | 46.4948 KOps/s | |
test_set_nested_new | 84.3670μs | 25.7068μs | 38.9002 KOps/s | 41.2082 KOps/s | |
test_select | 91.5110μs | 43.2512μs | 23.1208 KOps/s | 24.6368 KOps/s | |
test_select_nested | 94.6870μs | 59.4393μs | 16.8239 KOps/s | 16.6718 KOps/s | |
test_exclude_nested | 0.1439ms | 79.1256μs | 12.6381 KOps/s | 12.3891 KOps/s | |
test_empty[True] | 0.5395ms | 0.3872ms | 2.5828 KOps/s | 2.6001 KOps/s | |
test_empty[False] | 10.6573μs | 1.2286μs | 813.9233 KOps/s | 834.9057 KOps/s | |
test_unbind_speed | 0.3662ms | 0.2572ms | 3.8878 KOps/s | 3.8885 KOps/s | |
test_unbind_speed_stack0 | 0.4439ms | 0.2573ms | 3.8869 KOps/s | 3.9457 KOps/s | |
test_unbind_speed_stack1 | 97.8296ms | 0.7619ms | 1.3126 KOps/s | 1.4431 KOps/s | |
test_split | 97.4792ms | 1.6947ms | 590.0741 Ops/s | 593.4278 Ops/s | |
test_chunk | 87.9642ms | 1.6717ms | 598.1798 Ops/s | 589.9872 Ops/s | |
test_consolidate_njt[False-None] | 8.2199ms | 7.9504ms | 125.7805 Ops/s | 123.4681 Ops/s | |
test_creation[device0] | 0.2287ms | 89.7671μs | 11.1399 KOps/s | 11.1468 KOps/s | |
test_creation_from_tensor | 3.3382ms | 93.1760μs | 10.7324 KOps/s | 10.6512 KOps/s | |
test_add_one[memmap_tensor0] | 0.1694ms | 5.0436μs | 198.2715 KOps/s | 205.8027 KOps/s | |
test_contiguous[memmap_tensor0] | 19.5760μs | 0.5182μs | 1.9298 MOps/s | 1.9479 MOps/s | |
test_stack[memmap_tensor0] | 27.7720μs | 3.4644μs | 288.6464 KOps/s | 301.3731 KOps/s | |
test_memmaptd_index | 1.0750ms | 0.2310ms | 4.3284 KOps/s | 4.3668 KOps/s | |
test_memmaptd_index_astensor | 0.5692ms | 0.3101ms | 3.2245 KOps/s | 3.2743 KOps/s | |
test_memmaptd_index_op | 0.9665ms | 0.5655ms | 1.7685 KOps/s | 1.8775 KOps/s | |
test_serialize_model | 0.1188s | 0.1146s | 8.7256 Ops/s | 7.5592 Ops/s | |
test_serialize_model_pickle | 0.4972s | 0.4023s | 2.4854 Ops/s | 2.5497 Ops/s | |
test_serialize_weights | 0.2147s | 0.1282s | 7.8014 Ops/s | 8.9413 Ops/s | |
test_serialize_weights_returnearly | 0.1781s | 0.1611s | 6.2086 Ops/s | 6.5059 Ops/s | |
test_serialize_weights_pickle | 0.6227s | 0.4369s | 2.2891 Ops/s | 2.4428 Ops/s | |
test_serialize_weights_filesystem | 0.1501s | 0.1422s | 7.0347 Ops/s | 6.4511 Ops/s | |
test_serialize_model_filesystem | 0.1552s | 0.1442s | 6.9351 Ops/s | 6.7147 Ops/s | |
test_reshape_pytree | 60.5730μs | 26.3182μs | 37.9965 KOps/s | 37.7020 KOps/s | |
test_reshape_td | 78.2950μs | 32.1377μs | 31.1161 KOps/s | 30.3092 KOps/s | |
test_view_pytree | 58.5890μs | 26.1650μs | 38.2190 KOps/s | 37.3686 KOps/s | |
test_view_td | 89.4770μs | 36.5136μs | 27.3871 KOps/s | 26.2341 KOps/s | |
test_unbind_pytree | 87.0320μs | 29.8989μs | 33.4461 KOps/s | 33.5391 KOps/s | |
test_unbind_td | 0.3852ms | 38.1282μs | 26.2273 KOps/s | 26.1764 KOps/s | |
test_split_pytree | 82.6160μs | 29.1199μs | 34.3408 KOps/s | 34.0736 KOps/s | |
test_split_td | 0.2059ms | 42.9501μs | 23.2829 KOps/s | 22.8723 KOps/s | |
test_add_pytree | 79.7690μs | 35.4336μs | 28.2218 KOps/s | 28.2234 KOps/s | |
test_add_td | 0.1335ms | 55.8890μs | 17.8926 KOps/s | 19.5790 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1413ms | 61.3440μs | 16.3015 KOps/s | 16.1544 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3538ms | 0.1604ms | 6.2347 KOps/s | 6.2130 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1127ms | 45.2792μs | 22.0852 KOps/s | 22.0988 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2426ms | 0.1185ms | 8.4357 KOps/s | 8.3438 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 86.5520μs | 26.0546μs | 38.3810 KOps/s | 38.7968 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1319ms | 51.8233μs | 19.2963 KOps/s | 18.3065 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1793ms | 76.5367μs | 13.0656 KOps/s | 12.5359 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1326ms | 65.8795μs | 15.1792 KOps/s | 14.4202 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1778ms | 0.1035ms | 9.6577 KOps/s | 9.6261 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4123ms | 0.2035ms | 4.9141 KOps/s | 5.0838 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1047ms | 43.7201μs | 22.8728 KOps/s | 22.8524 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4973ms | 62.2330μs | 16.0686 KOps/s | 16.4488 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1861ms | 0.1052ms | 9.5012 KOps/s | 9.9428 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3874ms | 0.2013ms | 4.9682 KOps/s | 4.8880 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4237ms | 0.2128ms | 4.6998 KOps/s | 4.7581 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2489ms | 0.1048ms | 9.5465 KOps/s | 9.6542 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1250ms | 54.3471μs | 18.4002 KOps/s | 18.7853 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 95.0580μs | 45.4906μs | 21.9825 KOps/s | 22.0079 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6027ms | 0.1600ms | 6.2499 KOps/s | 6.2028 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2002ms | 0.1029ms | 9.7144 KOps/s | 9.8346 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 64.8410μs | 21.2628μs | 47.0304 KOps/s | 48.1316 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1271ms | 58.1761μs | 17.1892 KOps/s | 16.5246 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1618ms | 80.2892μs | 12.4550 KOps/s | 12.3034 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1458ms | 68.7332μs | 14.5490 KOps/s | 14.1638 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3115ms | 0.2091ms | 4.7829 KOps/s | 4.9536 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.4729ms | 1.2738ms | 785.0589 Ops/s | 790.7157 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3857ms | 0.2043ms | 4.8948 KOps/s | 5.0448 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 0.9289ms | 0.7728ms | 1.2941 KOps/s | 1.3054 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.6155ms | 0.4528ms | 2.2086 KOps/s | 2.2075 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.7657ms | 2.6528ms | 376.9601 Ops/s | 409.4325 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 90.4880μs | 34.9334μs | 28.6259 KOps/s | 28.8112 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.4919ms | 33.2559μs | 30.0699 KOps/s | 31.6532 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 80.0900μs | 28.9171μs | 34.5816 KOps/s | 34.8967 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 70.0000μs | 23.4024μs | 42.7307 KOps/s | 42.8303 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 78.2960μs | 29.7591μs | 33.6031 KOps/s | 33.2838 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 98.4030μs | 23.4975μs | 42.5577 KOps/s | 42.7414 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 94.5560μs | 49.8975μs | 20.0411 KOps/s | 19.6790 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5086ms | 19.2547μs | 51.9353 KOps/s | 50.4218 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 91.1100μs | 42.9004μs | 23.3098 KOps/s | 22.4385 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 63.1880μs | 18.4328μs | 54.2511 KOps/s | 52.0288 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1086ms | 43.9466μs | 22.7549 KOps/s | 22.1690 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 52.0170μs | 18.3939μs | 54.3657 KOps/s | 51.5644 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1182ms | 51.0917μs | 19.5726 KOps/s | 19.2716 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.0338ms | 19.2690μs | 51.8969 KOps/s | 50.3346 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1772ms | 44.5571μs | 22.4431 KOps/s | 22.1694 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 69.4600μs | 18.2895μs | 54.6763 KOps/s | 52.4223 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1024ms | 45.0256μs | 22.2096 KOps/s | 22.4714 KOps/s | |
test_compile_indexing[int-pytree-eager] | 50.6250μs | 18.0372μs | 55.4408 KOps/s | 52.7943 KOps/s | |
test_mod_add[eager] | 75.0100μs | 34.3369μs | 29.1232 KOps/s | 30.7822 KOps/s | |
test_mod_add[compile] | 89.6270μs | 47.5171μs | 21.0450 KOps/s | 21.3551 KOps/s | |
test_mod_add[compile-overhead] | 0.1272ms | 47.1411μs | 21.2129 KOps/s | 21.3232 KOps/s | |
test_mod_wrap[eager] | 0.3610ms | 0.2222ms | 4.5012 KOps/s | 4.6039 KOps/s | |
test_mod_wrap[compile] | 0.6947ms | 0.2109ms | 4.7420 KOps/s | 4.9249 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3020ms | 0.2039ms | 4.9033 KOps/s | 4.9888 KOps/s | |
test_mod_wrap_and_backward[eager] | 15.7557ms | 11.6828ms | 85.5959 Ops/s | 94.2527 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.9781ms | 12.5261ms | 79.8335 Ops/s | 74.5663 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 18.4507ms | 11.3747ms | 87.9147 Ops/s | 77.5154 Ops/s | |
test_seq_add[eager] | 0.2586ms | 0.1114ms | 8.9768 KOps/s | 9.2359 KOps/s | |
test_seq_add[compile] | 0.1461ms | 62.6942μs | 15.9504 KOps/s | 16.8221 KOps/s | |
test_seq_add[compile-overhead] | 0.1382ms | 60.3633μs | 16.5664 KOps/s | 17.3029 KOps/s | |
test_seq_wrap[eager] | 0.6173ms | 0.4441ms | 2.2516 KOps/s | 2.3518 KOps/s | |
test_seq_wrap[compile] | 0.4069ms | 0.2250ms | 4.4440 KOps/s | 4.3953 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4344ms | 0.2242ms | 4.4612 KOps/s | 4.4185 KOps/s | |
test_func_call_runtime[False-eager] | 0.8441ms | 0.5530ms | 1.8082 KOps/s | 1.8799 KOps/s | |
test_func_call_runtime[False-compile] | 0.7867ms | 0.4260ms | 2.3472 KOps/s | 2.3793 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.7327ms | 0.4266ms | 2.3442 KOps/s | 2.3773 KOps/s | |
test_func_call_runtime[True-eager] | 1.2342ms | 0.7712ms | 1.2966 KOps/s | 1.3501 KOps/s | |
test_func_call_runtime[True-compile] | 0.8553ms | 0.4644ms | 2.1532 KOps/s | 2.1910 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.7829ms | 0.4634ms | 2.1580 KOps/s | 2.1774 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9664ms | 0.5482ms | 1.8241 KOps/s | 1.9000 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6743ms | 0.4265ms | 2.3444 KOps/s | 2.3739 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6031ms | 0.4232ms | 2.3631 KOps/s | 2.3828 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0198ms | 0.8990ms | 1.1124 KOps/s | 1.1430 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.7435ms | 0.4857ms | 2.0587 KOps/s | 2.0672 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.6396ms | 0.4869ms | 2.0540 KOps/s | 2.0570 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4709ms | 1.8671ms | 535.5815 Ops/s | 534.4212 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.0066ms | 0.5121ms | 1.9526 KOps/s | 1.9475 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.6993ms | 0.5090ms | 1.9647 KOps/s | 1.9352 KOps/s | |
test_distributed | 0.2502ms | 0.1259ms | 7.9439 KOps/s | 7.8636 KOps/s | |
test_tdmodule | 44.2930μs | 25.4773μs | 39.2507 KOps/s | 40.9623 KOps/s | |
test_tdmodule_dispatch | 68.2270μs | 47.1190μs | 21.2228 KOps/s | 22.2375 KOps/s | |
test_tdseq | 45.0650μs | 25.5288μs | 39.1714 KOps/s | 39.8798 KOps/s | |
test_tdseq_dispatch | 71.2530μs | 50.4563μs | 19.8191 KOps/s | 21.1636 KOps/s | |
test_instantiation_functorch | 2.5837ms | 1.5423ms | 648.3636 Ops/s | 651.2866 Ops/s | |
test_exec_functorch | 0.4440ms | 0.1818ms | 5.5006 KOps/s | 5.6647 KOps/s | |
test_exec_functional_call | 0.2913ms | 0.1720ms | 5.8155 KOps/s | 5.8082 KOps/s | |
test_exec_td_decorator | 0.8032ms | 0.2309ms | 4.3308 KOps/s | 4.4008 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9037ms | 0.6432ms | 1.5548 KOps/s | 1.5344 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.1195ms | 0.6465ms | 1.5468 KOps/s | 1.5495 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.9020ms | 0.5204ms | 1.9214 KOps/s | 1.9036 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7543ms | 0.5214ms | 1.9181 KOps/s | 1.9089 KOps/s | |
test_to_module_speed[True] | 1.6342ms | 1.2780ms | 782.4980 Ops/s | 773.5001 Ops/s | |
test_to_module_speed[False] | 1.8639ms | 1.2505ms | 799.6731 Ops/s | 797.9767 Ops/s | |
test_tc_init | 83.1250μs | 44.9311μs | 22.2563 KOps/s | 22.5224 KOps/s | |
test_tc_init_nested | 0.1624ms | 89.7266μs | 11.1450 KOps/s | 11.2434 KOps/s | |
test_tc_first_layer_tensor | 16.2700μs | 1.5059μs | 664.0336 KOps/s | 665.3972 KOps/s | |
test_tc_first_layer_nontensor | 43.1680μs | 4.6738μs | 213.9590 KOps/s | 209.4003 KOps/s | |
test_tc_second_layer_tensor | 28.8260μs | 2.7465μs | 364.0979 KOps/s | 356.0032 KOps/s | |
test_tc_second_layer_nontensor | 25.5280μs | 5.9156μs | 169.0443 KOps/s | 161.9859 KOps/s | |
test_unbind | 0.2219s | 12.5637ms | 79.5942 Ops/s | 80.2995 Ops/s | |
test_full_like | 8.9333ms | 7.6589ms | 130.5664 Ops/s | 129.6677 Ops/s | |
test_zeros_like | 3.5093ms | 2.9387ms | 340.2870 Ops/s | 316.8009 Ops/s | |
test_ones_like | 4.0139ms | 3.3555ms | 298.0215 Ops/s | 295.7131 Ops/s | |
test_clone | 6.3453ms | 5.5272ms | 180.9238 Ops/s | 171.6295 Ops/s | |
test_squeeze | 58.2280μs | 11.9024μs | 84.0167 KOps/s | 85.7733 KOps/s | |
test_unsqueeze | 0.2926ms | 88.0655μs | 11.3552 KOps/s | 11.3051 KOps/s | |
test_split | 0.3124ms | 0.1887ms | 5.2983 KOps/s | 5.1691 KOps/s | |
test_permute | 0.3445ms | 0.2146ms | 4.6603 KOps/s | 4.4971 KOps/s | |
test_stack | 31.2211ms | 27.5893ms | 36.2460 Ops/s | 36.4065 Ops/s | |
test_cat | 32.0271ms | 27.5466ms | 36.3021 Ops/s | 37.8247 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 33.3510μs | 9.9357μs | 100.6476 KOps/s | 92.3390 KOps/s | |
test_plain_set_stack_nested | 32.2500μs | 9.9517μs | 100.4850 KOps/s | 92.7040 KOps/s | |
test_plain_set_nested_inplace | 35.3600μs | 10.9052μs | 91.6997 KOps/s | 85.4620 KOps/s | |
test_plain_set_stack_nested_inplace | 38.5010μs | 10.8359μs | 92.2857 KOps/s | 86.4009 KOps/s | |
test_items | 28.9900μs | 2.8977μs | 345.1036 KOps/s | 340.2388 KOps/s | |
test_items_nested | 0.4131ms | 0.3578ms | 2.7951 KOps/s | 2.8347 KOps/s | |
test_items_nested_locked | 0.4421ms | 0.3589ms | 2.7862 KOps/s | 2.8012 KOps/s | |
test_items_nested_leaf | 0.2225ms | 58.1874μs | 17.1859 KOps/s | 17.2315 KOps/s | |
test_items_stack_nested | 0.3879ms | 0.3577ms | 2.7956 KOps/s | 2.8084 KOps/s | |
test_items_stack_nested_leaf | 0.2406ms | 59.3634μs | 16.8454 KOps/s | 17.1401 KOps/s | |
test_items_stack_nested_locked | 0.5282ms | 0.3621ms | 2.7620 KOps/s | 2.7883 KOps/s | |
test_keys | 0.1855ms | 3.4931μs | 286.2802 KOps/s | 290.1901 KOps/s | |
test_keys_nested | 0.1790ms | 70.4208μs | 14.2003 KOps/s | 14.3998 KOps/s | |
test_keys_nested_locked | 0.7846ms | 75.5123μs | 13.2429 KOps/s | 13.2308 KOps/s | |
test_keys_nested_leaf | 95.4010μs | 61.7106μs | 16.2047 KOps/s | 16.2991 KOps/s | |
test_keys_stack_nested | 0.1079ms | 71.0212μs | 14.0803 KOps/s | 14.1966 KOps/s | |
test_keys_stack_nested_leaf | 88.5610μs | 61.6142μs | 16.2300 KOps/s | 16.3191 KOps/s | |
test_keys_stack_nested_locked | 0.1258ms | 76.2583μs | 13.1133 KOps/s | 13.2059 KOps/s | |
test_values | 4.7083μs | 0.8540μs | 1.1710 MOps/s | 1.1781 MOps/s | |
test_values_nested | 72.5810μs | 31.2869μs | 31.9622 KOps/s | 32.0912 KOps/s | |
test_values_nested_locked | 60.0810μs | 33.8407μs | 29.5502 KOps/s | 30.5312 KOps/s | |
test_values_nested_leaf | 0.1557ms | 33.3287μs | 30.0042 KOps/s | 29.8605 KOps/s | |
test_values_stack_nested | 63.5510μs | 31.8468μs | 31.4003 KOps/s | 31.6706 KOps/s | |
test_values_stack_nested_leaf | 60.0810μs | 34.1037μs | 29.3223 KOps/s | 29.7010 KOps/s | |
test_values_stack_nested_locked | 63.1010μs | 34.0368μs | 29.3800 KOps/s | 30.2765 KOps/s | |
test_membership | 1.6325μs | 0.5071μs | 1.9720 MOps/s | 1.9555 MOps/s | |
test_membership_nested | 13.9400μs | 1.9930μs | 501.7553 KOps/s | 498.7240 KOps/s | |
test_membership_nested_leaf | 17.6055μs | 2.0358μs | 491.2117 KOps/s | 492.9482 KOps/s | |
test_membership_stacked_nested | 33.0410μs | 2.0992μs | 476.3685 KOps/s | 485.6215 KOps/s | |
test_membership_stacked_nested_leaf | 23.7200μs | 2.0879μs | 478.9611 KOps/s | 481.8746 KOps/s | |
test_membership_nested_last | 33.4110μs | 2.9549μs | 338.4218 KOps/s | 339.0604 KOps/s | |
test_membership_nested_leaf_last | 0.1583ms | 2.9145μs | 343.1118 KOps/s | 341.1922 KOps/s | |
test_membership_stacked_nested_last | 80.6310μs | 2.9241μs | 341.9890 KOps/s | 336.5874 KOps/s | |
test_membership_stacked_nested_leaf_last | 38.5600μs | 2.9230μs | 342.1176 KOps/s | 339.4754 KOps/s | |
test_nested_getleaf | 28.7500μs | 6.1448μs | 162.7404 KOps/s | 162.4165 KOps/s | |
test_nested_get | 29.7400μs | 5.8266μs | 171.6263 KOps/s | 171.1119 KOps/s | |
test_stacked_getleaf | 35.0000μs | 6.1295μs | 163.1461 KOps/s | 162.4668 KOps/s | |
test_stacked_get | 33.5700μs | 5.8697μs | 170.3668 KOps/s | 172.5168 KOps/s | |
test_nested_getitemleaf | 34.0100μs | 6.2346μs | 160.3958 KOps/s | 160.7467 KOps/s | |
test_nested_getitem | 34.5700μs | 5.9395μs | 168.3635 KOps/s | 168.6220 KOps/s | |
test_stacked_getitemleaf | 34.5710μs | 6.2002μs | 161.2864 KOps/s | 161.0827 KOps/s | |
test_stacked_getitem | 29.3000μs | 5.9389μs | 168.3816 KOps/s | 168.8606 KOps/s | |
test_lock_nested | 9.6404ms | 0.3825ms | 2.6144 KOps/s | 2.7177 KOps/s | |
test_lock_stack_nested | 0.4591ms | 0.3420ms | 2.9236 KOps/s | 2.9926 KOps/s | |
test_unlock_nested | 0.6509ms | 0.3128ms | 3.1969 KOps/s | 3.3110 KOps/s | |
test_unlock_stack_nested | 0.3156ms | 0.2806ms | 3.5639 KOps/s | 3.6798 KOps/s | |
test_flatten_speed | 0.1789ms | 74.8866μs | 13.3535 KOps/s | 13.2200 KOps/s | |
test_unflatten_speed | 0.3487ms | 0.3098ms | 3.2276 KOps/s | 3.2951 KOps/s | |
test_common_ops | 1.6314ms | 0.5787ms | 1.7281 KOps/s | 1.6706 KOps/s | |
test_creation | 78.2310μs | 1.4902μs | 671.0701 KOps/s | 689.2597 KOps/s | |
test_creation_empty | 30.3910μs | 6.0088μs | 166.4232 KOps/s | 128.6397 KOps/s | |
test_creation_nested_1 | 34.7600μs | 7.6306μs | 131.0508 KOps/s | 107.7846 KOps/s | |
test_creation_nested_2 | 34.3310μs | 10.1901μs | 98.1344 KOps/s | 84.2769 KOps/s | |
test_clone | 0.1233ms | 11.5017μs | 86.9434 KOps/s | 96.3121 KOps/s | |
test_getitem[int] | 1.5979ms | 10.9437μs | 91.3769 KOps/s | 94.9589 KOps/s | |
test_getitem[slice_int] | 93.3793ms | 30.5357μs | 32.7485 KOps/s | 48.1302 KOps/s | |
test_getitem[range] | 0.1882ms | 38.7750μs | 25.7898 KOps/s | 26.6128 KOps/s | |
test_getitem[tuple] | 0.1124ms | 19.0837μs | 52.4006 KOps/s | 55.1520 KOps/s | |
test_getitem[list] | 0.1973ms | 34.6528μs | 28.8577 KOps/s | 30.4330 KOps/s | |
test_setitem_dim[int] | 0.1293ms | 20.2406μs | 49.4057 KOps/s | 54.0073 KOps/s | |
test_setitem_dim[slice_int] | 97.2710μs | 39.6492μs | 25.2212 KOps/s | 26.2745 KOps/s | |
test_setitem_dim[range] | 0.1628ms | 54.8799μs | 18.2216 KOps/s | 18.9993 KOps/s | |
test_setitem_dim[tuple] | 90.3510μs | 33.6151μs | 29.7485 KOps/s | 31.1464 KOps/s | |
test_setitem | 76.9510μs | 15.0856μs | 66.2886 KOps/s | 66.6526 KOps/s | |
test_set | 86.6810μs | 14.3895μs | 69.4950 KOps/s | 69.8493 KOps/s | |
test_set_shared | 1.6198ms | 0.1481ms | 6.7524 KOps/s | 6.7704 KOps/s | |
test_update | 0.3093ms | 16.2501μs | 61.5382 KOps/s | 57.9157 KOps/s | |
test_update_nested | 85.2410μs | 21.5123μs | 46.4850 KOps/s | 45.8645 KOps/s | |
test_update__nested | 0.5295ms | 26.0403μs | 38.4020 KOps/s | 40.9848 KOps/s | |
test_set_nested | 76.2810μs | 15.8286μs | 63.1767 KOps/s | 64.1168 KOps/s | |
test_set_nested_new | 0.1941ms | 18.0566μs | 55.3813 KOps/s | 55.1027 KOps/s | |
test_select | 0.2218ms | 28.8441μs | 34.6691 KOps/s | 33.1489 KOps/s | |
test_select_nested | 0.2245ms | 41.8267μs | 23.9082 KOps/s | 24.3637 KOps/s | |
test_exclude_nested | 0.2537ms | 62.4361μs | 16.0164 KOps/s | 16.4189 KOps/s | |
test_empty[True] | 0.4620ms | 0.2762ms | 3.6205 KOps/s | 3.6261 KOps/s | |
test_empty[False] | 20.0593μs | 0.7421μs | 1.3475 MOps/s | 1.3453 MOps/s | |
test_to | 87.4710μs | 56.2907μs | 17.7649 KOps/s | 15.9360 KOps/s | |
test_to_nonblocking | 0.2161ms | 47.9837μs | 20.8404 KOps/s | 21.5588 KOps/s | |
test_unbind_speed | 0.3298ms | 0.2367ms | 4.2249 KOps/s | 4.3929 KOps/s | |
test_unbind_speed_stack0 | 0.4129ms | 0.2361ms | 4.2348 KOps/s | 4.3819 KOps/s | |
test_unbind_speed_stack1 | 95.1182ms | 0.6578ms | 1.5202 KOps/s | 1.5523 KOps/s | |
test_split | 93.3774ms | 1.6297ms | 613.6267 Ops/s | 641.3184 Ops/s | |
test_chunk | 96.3623ms | 1.6415ms | 609.1941 Ops/s | 588.1018 Ops/s | |
test_consolidate[False-None] | 96.0018ms | 2.8700ms | 348.4279 Ops/s | 379.7752 Ops/s | |
test_consolidate[default-None] | 1.8802ms | 1.7122ms | 584.0436 Ops/s | 595.4387 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.9046ms | 1.7511ms | 571.0728 Ops/s | 579.8443 Ops/s | |
test_consolidate_njt[False-None] | 6.9470ms | 6.6726ms | 149.8661 Ops/s | 153.2501 Ops/s | |
test_to[False-False-None] | 1.9328ms | 1.7640ms | 566.8854 Ops/s | 585.1548 Ops/s | |
test_to[True-False-None] | 1.6291ms | 1.3631ms | 733.6110 Ops/s | 755.1963 Ops/s | |
test_to[within-False-None] | 4.4668ms | 4.1340ms | 241.8991 Ops/s | 245.5670 Ops/s | |
test_to[True-default-None] | 6.0380ms | 5.4006ms | 185.1633 Ops/s | 177.4829 Ops/s | |
test_to_njt[False-False-None] | 7.3587ms | 7.0436ms | 141.9724 Ops/s | 136.7390 Ops/s | |
test_to_njt[True-False-None] | 5.8612ms | 5.5694ms | 179.5519 Ops/s | 179.9506 Ops/s | |
test_to_njt[within-False-None] | 13.3366ms | 13.0496ms | 76.6309 Ops/s | 81.6099 Ops/s | |
test_creation[device0] | 0.5312ms | 83.2703μs | 12.0091 KOps/s | 12.3222 KOps/s | |
test_creation_from_tensor | 0.6001ms | 86.8578μs | 11.5131 KOps/s | 12.0431 KOps/s | |
test_add_one[memmap_tensor0] | 0.4332ms | 7.3347μs | 136.3379 KOps/s | 146.0011 KOps/s | |
test_contiguous[memmap_tensor0] | 2.0255μs | 0.4146μs | 2.4120 MOps/s | 2.4296 MOps/s | |
test_stack[memmap_tensor0] | 0.1519ms | 4.7768μs | 209.3468 KOps/s | 223.5894 KOps/s | |
test_memmaptd_index | 1.6332ms | 0.2573ms | 3.8869 KOps/s | 3.9526 KOps/s | |
test_memmaptd_index_astensor | 0.8041ms | 0.3130ms | 3.1951 KOps/s | 3.2183 KOps/s | |
test_memmaptd_index_op | 1.0412ms | 0.5799ms | 1.7243 KOps/s | 1.6813 KOps/s | |
test_serialize_model | 0.1313s | 0.1306s | 7.6590 Ops/s | 7.6983 Ops/s | |
test_serialize_model_pickle | 1.3500s | 1.2118s | 0.8252 Ops/s | 0.8410 Ops/s | |
test_serialize_weights | 0.1313s | 0.1299s | 7.7012 Ops/s | 7.6912 Ops/s | |
test_serialize_weights_returnearly | 0.4676s | 69.8374ms | 14.3190 Ops/s | 14.0725 Ops/s | |
test_serialize_weights_pickle | 1.3760s | 1.2215s | 0.8187 Ops/s | 0.8207 Ops/s | |
test_reshape_pytree | 0.1325ms | 22.6313μs | 44.1866 KOps/s | 44.1087 KOps/s | |
test_reshape_td | 54.0800μs | 27.0730μs | 36.9372 KOps/s | 37.2632 KOps/s | |
test_view_pytree | 0.1695ms | 22.4896μs | 44.4650 KOps/s | 44.5666 KOps/s | |
test_view_td | 0.1319ms | 30.4743μs | 32.8146 KOps/s | 32.7224 KOps/s | |
test_unbind_pytree | 0.1176ms | 28.3572μs | 35.2645 KOps/s | 35.2949 KOps/s | |
test_unbind_td | 0.8115ms | 36.2245μs | 27.6057 KOps/s | 27.1138 KOps/s | |
test_split_pytree | 0.1711ms | 30.5260μs | 32.7589 KOps/s | 32.8573 KOps/s | |
test_split_td | 1.0086ms | 40.2191μs | 24.8638 KOps/s | 25.9432 KOps/s | |
test_add_pytree | 0.1486ms | 35.9928μs | 27.7833 KOps/s | 28.7524 KOps/s | |
test_add_td | 0.1363ms | 46.6569μs | 21.4331 KOps/s | 21.0855 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2714ms | 0.1227ms | 8.1477 KOps/s | 7.9790 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2761ms | 0.1249ms | 8.0044 KOps/s | 8.0364 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2463ms | 97.1054μs | 10.2981 KOps/s | 10.0614 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3021ms | 0.1551ms | 6.4486 KOps/s | 6.6432 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1728ms | 23.7380μs | 42.1265 KOps/s | 45.6348 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1469ms | 27.2133μs | 36.7467 KOps/s | 37.0797 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.4069ms | 64.9340μs | 15.4002 KOps/s | 15.3493 KOps/s | |
test_compile_copy_nested[pytree-eager] | 99.0020μs | 49.3752μs | 20.2531 KOps/s | 20.1735 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2996ms | 0.1438ms | 6.9533 KOps/s | 6.8993 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4136ms | 0.2113ms | 4.7317 KOps/s | 4.7990 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2459ms | 99.7705μs | 10.0230 KOps/s | 10.1241 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2164ms | 54.2096μs | 18.4469 KOps/s | 18.7563 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2774ms | 0.1375ms | 7.2713 KOps/s | 6.9900 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6610ms | 0.4998ms | 2.0009 KOps/s | 1.9252 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3983ms | 0.2504ms | 3.9944 KOps/s | 4.0148 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2890ms | 0.1458ms | 6.8585 KOps/s | 6.7291 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2241ms | 65.4544μs | 15.2778 KOps/s | 15.7814 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2859ms | 0.1058ms | 9.4544 KOps/s | 9.7288 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5821ms | 0.4204ms | 2.3789 KOps/s | 2.4204 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3317ms | 0.1439ms | 6.9508 KOps/s | 7.3461 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2073ms | 23.0498μs | 43.3844 KOps/s | 52.5312 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 52.8610μs | 26.6543μs | 37.5173 KOps/s | 37.1144 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1155ms | 69.9801μs | 14.2898 KOps/s | 14.3461 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1602ms | 51.5460μs | 19.4002 KOps/s | 19.3244 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6252ms | 0.3930ms | 2.5448 KOps/s | 2.0202 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.0373ms | 2.7189ms | 367.8008 Ops/s | 380.7693 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6038ms | 0.4359ms | 2.2939 KOps/s | 2.2049 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.1994ms | 2.8375ms | 352.4265 Ops/s | 373.9704 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.3487ms | 0.1227ms | 8.1513 KOps/s | 8.2499 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5680ms | 83.7381μs | 11.9420 KOps/s | 11.8900 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.5303ms | 0.1132ms | 8.8347 KOps/s | 8.8832 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2493ms | 73.9601μs | 13.5208 KOps/s | 13.7213 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2844ms | 0.1128ms | 8.8614 KOps/s | 8.8051 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2393ms | 71.1557μs | 14.0537 KOps/s | 13.8153 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2523ms | 0.1033ms | 9.6798 KOps/s | 9.3712 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1651ms | 18.6784μs | 53.5379 KOps/s | 56.4221 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2473ms | 98.7546μs | 10.1261 KOps/s | 10.0901 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1480ms | 16.6734μs | 59.9759 KOps/s | 61.5076 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2559ms | 98.9866μs | 10.1024 KOps/s | 9.6738 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1461ms | 16.6022μs | 60.2331 KOps/s | 62.6721 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.3008ms | 0.1082ms | 9.2442 KOps/s | 9.4827 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6451ms | 18.1102μs | 55.2175 KOps/s | 57.3037 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2622ms | 0.1019ms | 9.8153 KOps/s | 9.7036 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.2559ms | 17.2026μs | 58.1308 KOps/s | 62.8080 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2502ms | 98.9855μs | 10.1025 KOps/s | 9.8276 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1606ms | 16.9901μs | 58.8579 KOps/s | 63.1049 KOps/s | |
test_mod_add[eager] | 0.1871ms | 37.3469μs | 26.7760 KOps/s | 26.6565 KOps/s | |
test_mod_add[compile] | 0.3115ms | 82.5076μs | 12.1201 KOps/s | 12.1346 KOps/s | |
test_mod_add[compile-overhead] | 0.3211ms | 0.1668ms | 5.9964 KOps/s | 5.6405 KOps/s | |
test_mod_wrap[eager] | 0.3993ms | 0.2534ms | 3.9461 KOps/s | 3.9124 KOps/s | |
test_mod_wrap[compile] | 0.4418ms | 0.2895ms | 3.4547 KOps/s | 3.4350 KOps/s | |
test_mod_wrap[compile-overhead] | 7.5301ms | 3.8304ms | 261.0681 Ops/s | 262.3746 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5950ms | 1.3933ms | 717.7305 Ops/s | 675.6480 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.6460ms | 1.2953ms | 772.0297 Ops/s | 706.0070 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4025ms | 0.9375ms | 1.0667 KOps/s | 925.3949 Ops/s | |
test_seq_add[eager] | 0.2875ms | 0.1155ms | 8.6545 KOps/s | 8.6897 KOps/s | |
test_seq_add[compile] | 0.2380ms | 90.9785μs | 10.9916 KOps/s | 10.5024 KOps/s | |
test_seq_add[compile-overhead] | 0.3108ms | 0.1295ms | 7.7191 KOps/s | 7.6346 KOps/s | |
test_seq_wrap[eager] | 0.6275ms | 0.4396ms | 2.2747 KOps/s | 2.2280 KOps/s | |
test_seq_wrap[compile] | 0.5040ms | 0.3076ms | 3.2511 KOps/s | 3.2577 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3805ms | 0.2252ms | 4.4414 KOps/s | 4.4483 KOps/s | |
test_func_call_runtime[False-eager] | 0.9150ms | 0.7558ms | 1.3231 KOps/s | 1.3135 KOps/s | |
test_func_call_runtime[False-compile] | 0.9318ms | 0.7603ms | 1.3154 KOps/s | 1.3176 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5008ms | 0.3647ms | 2.7417 KOps/s | 2.7543 KOps/s | |
test_func_call_runtime[True-eager] | 1.0712ms | 0.9220ms | 1.0846 KOps/s | 1.0883 KOps/s | |
test_func_call_runtime[True-compile] | 0.9345ms | 0.7813ms | 1.2799 KOps/s | 1.2815 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5334ms | 0.3854ms | 2.5945 KOps/s | 2.5959 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9451ms | 0.7573ms | 1.3204 KOps/s | 1.3105 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9173ms | 0.7593ms | 1.3170 KOps/s | 1.3105 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5109ms | 0.3661ms | 2.7314 KOps/s | 2.7276 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1849ms | 1.0168ms | 983.4658 Ops/s | 977.7996 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9795ms | 0.8119ms | 1.2317 KOps/s | 1.2350 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5724ms | 0.4124ms | 2.4248 KOps/s | 2.4176 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6368ms | 2.1191ms | 471.8971 Ops/s | 471.5125 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.2258ms | 0.8221ms | 1.2164 KOps/s | 1.2169 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5796ms | 0.4137ms | 2.4171 KOps/s | 2.4213 KOps/s | |
test_distributed | 5.4634ms | 0.3414ms | 2.9294 KOps/s | 8.7765 KOps/s | |
test_tdmodule | 0.1240ms | 18.2744μs | 54.7215 KOps/s | 51.3984 KOps/s | |
test_tdmodule_dispatch | 0.1292ms | 32.0881μs | 31.1642 KOps/s | 29.2498 KOps/s | |
test_tdseq | 0.1975ms | 17.8411μs | 56.0503 KOps/s | 52.8395 KOps/s | |
test_tdseq_dispatch | 53.9710μs | 33.7954μs | 29.5899 KOps/s | 27.3646 KOps/s | |
test_instantiation_functorch | 1.8311ms | 1.5689ms | 637.4036 Ops/s | 640.2591 Ops/s | |
test_exec_functorch | 0.3442ms | 0.1486ms | 6.7292 KOps/s | 6.9330 KOps/s | |
test_exec_functional_call | 0.3502ms | 0.1435ms | 6.9662 KOps/s | 7.1333 KOps/s | |
test_exec_td_decorator | 0.3924ms | 0.1909ms | 5.2372 KOps/s | 5.3581 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8675ms | 0.6858ms | 1.4581 KOps/s | 1.4484 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9316ms | 0.6972ms | 1.4344 KOps/s | 1.4292 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8004ms | 0.6141ms | 1.6285 KOps/s | 1.6559 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7523ms | 0.6018ms | 1.6616 KOps/s | 1.6563 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.0857ms | 19.5249ms | 51.2167 Ops/s | 51.3648 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.3281ms | 19.6504ms | 50.8895 Ops/s | 51.3163 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 20.1927ms | 19.4428ms | 51.4329 Ops/s | 51.7033 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 20.3224ms | 19.4788ms | 51.3378 Ops/s | 51.7244 Ops/s | |
test_to_module_speed[True] | 1.1055ms | 0.9370ms | 1.0672 KOps/s | 1.0571 KOps/s | |
test_to_module_speed[False] | 1.3737ms | 0.9271ms | 1.0787 KOps/s | 1.0888 KOps/s | |
test_tc_init | 92.3110μs | 33.6056μs | 29.7570 KOps/s | 27.6996 KOps/s | |
test_tc_init_nested | 0.1253ms | 66.8806μs | 14.9520 KOps/s | 13.6527 KOps/s | |
test_tc_first_layer_tensor | 10.8701μs | 0.6787μs | 1.4733 MOps/s | 1.4177 MOps/s | |
test_tc_first_layer_nontensor | 24.0400μs | 2.2733μs | 439.8974 KOps/s | 425.0373 KOps/s | |
test_tc_second_layer_tensor | 7.4378μs | 1.4105μs | 708.9494 KOps/s | 694.9952 KOps/s | |
test_tc_second_layer_nontensor | 24.5600μs | 3.0618μs | 326.6082 KOps/s | 328.0017 KOps/s | |
test_unbind | 0.2424s | 10.0200ms | 99.8006 Ops/s | 150.0311 Ops/s | |
test_full_like | 12.4830ms | 9.6874ms | 103.2267 Ops/s | 105.4291 Ops/s | |
test_zeros_like | 6.0053ms | 4.3424ms | 230.2848 Ops/s | 232.9390 Ops/s | |
test_ones_like | 9.5164ms | 7.2944ms | 137.0914 Ops/s | 235.8939 Ops/s | |
test_clone | 12.2790ms | 9.2723ms | 107.8477 Ops/s | 152.3798 Ops/s | |
test_squeeze | 61.2910μs | 9.4786μs | 105.5013 KOps/s | 109.4805 KOps/s | |
test_unsqueeze | 0.2162ms | 72.6871μs | 13.7576 KOps/s | 13.6228 KOps/s | |
test_split | 0.4004ms | 0.1629ms | 6.1400 KOps/s | 6.2696 KOps/s | |
test_permute | 0.3328ms | 0.1859ms | 5.3804 KOps/s | 5.5514 KOps/s | |
test_stack | 53.9124ms | 51.7321ms | 19.3303 Ops/s | 19.4467 Ops/s | |
test_cat | 53.6635ms | 51.3420ms | 19.4772 Ops/s | 19.5573 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):