-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] TensorDict.separates #1120
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Dec 2, 2024
ghstack-source-id: 0eb78cd649e28df666b41fb7d8766e14f7c6bfe8 Pull Request resolved: #1120
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Dec 2, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 54.8730μs | 18.2534μs | 54.7844 KOps/s | 53.4538 KOps/s | |
test_plain_set_stack_nested | 44.6840μs | 18.4487μs | 54.2045 KOps/s | 53.9841 KOps/s | |
test_plain_set_nested_inplace | 52.5090μs | 20.2055μs | 49.4915 KOps/s | 49.7396 KOps/s | |
test_plain_set_stack_nested_inplace | 57.0870μs | 20.0926μs | 49.7696 KOps/s | 49.3802 KOps/s | |
test_items | 32.6010μs | 4.0366μs | 247.7338 KOps/s | 240.0875 KOps/s | |
test_items_nested | 0.6958ms | 0.4035ms | 2.4781 KOps/s | 2.4751 KOps/s | |
test_items_nested_locked | 0.6421ms | 0.4018ms | 2.4890 KOps/s | 2.4774 KOps/s | |
test_items_nested_leaf | 0.1200ms | 71.5049μs | 13.9851 KOps/s | 14.1024 KOps/s | |
test_items_stack_nested | 0.4758ms | 0.4073ms | 2.4553 KOps/s | 2.4648 KOps/s | |
test_items_stack_nested_leaf | 0.1314ms | 74.4352μs | 13.4345 KOps/s | 13.2399 KOps/s | |
test_items_stack_nested_locked | 0.8710ms | 0.4045ms | 2.4719 KOps/s | 2.4991 KOps/s | |
test_keys | 20.1880μs | 3.5139μs | 284.5836 KOps/s | 283.4931 KOps/s | |
test_keys_nested | 0.2211ms | 0.1368ms | 7.3082 KOps/s | 7.0909 KOps/s | |
test_keys_nested_locked | 0.7053ms | 0.1427ms | 7.0071 KOps/s | 6.9528 KOps/s | |
test_keys_nested_leaf | 0.2010ms | 0.1186ms | 8.4320 KOps/s | 8.3591 KOps/s | |
test_keys_stack_nested | 0.2220ms | 0.1366ms | 7.3184 KOps/s | 7.2103 KOps/s | |
test_keys_stack_nested_leaf | 0.2001ms | 0.1175ms | 8.5074 KOps/s | 8.4359 KOps/s | |
test_keys_stack_nested_locked | 0.2663ms | 0.1434ms | 6.9745 KOps/s | 6.9141 KOps/s | |
test_values | 7.2014μs | 1.0522μs | 950.3586 KOps/s | 920.0374 KOps/s | |
test_values_nested | 0.1022ms | 56.1512μs | 17.8091 KOps/s | 18.2979 KOps/s | |
test_values_nested_locked | 0.1098ms | 56.5727μs | 17.6764 KOps/s | 18.3782 KOps/s | |
test_values_nested_leaf | 0.1080ms | 60.6712μs | 16.4823 KOps/s | 16.7485 KOps/s | |
test_values_stack_nested | 0.1036ms | 57.3260μs | 17.4441 KOps/s | 17.9595 KOps/s | |
test_values_stack_nested_leaf | 0.1084ms | 60.9115μs | 16.4173 KOps/s | 16.6672 KOps/s | |
test_values_stack_nested_locked | 0.1137ms | 57.0482μs | 17.5290 KOps/s | 17.9374 KOps/s | |
test_membership | 16.7920μs | 0.8989μs | 1.1124 MOps/s | 1.1228 MOps/s | |
test_membership_nested | 22.1920μs | 2.9378μs | 340.3936 KOps/s | 341.3718 KOps/s | |
test_membership_nested_leaf | 23.2130μs | 2.9655μs | 337.2141 KOps/s | 340.3297 KOps/s | |
test_membership_stacked_nested | 25.2670μs | 2.9695μs | 336.7525 KOps/s | 344.0093 KOps/s | |
test_membership_stacked_nested_leaf | 19.3870μs | 2.9181μs | 342.6880 KOps/s | 341.8451 KOps/s | |
test_membership_nested_last | 27.2010μs | 4.2379μs | 235.9634 KOps/s | 234.6322 KOps/s | |
test_membership_nested_leaf_last | 23.9550μs | 4.2303μs | 236.3905 KOps/s | 236.8551 KOps/s | |
test_membership_stacked_nested_last | 28.9840μs | 4.2266μs | 236.5945 KOps/s | 205.2153 KOps/s | |
test_membership_stacked_nested_leaf_last | 25.1580μs | 4.2446μs | 235.5927 KOps/s | 205.9137 KOps/s | |
test_nested_getleaf | 39.4340μs | 11.0945μs | 90.1347 KOps/s | 92.9675 KOps/s | |
test_nested_get | 57.1690μs | 10.4648μs | 95.5580 KOps/s | 96.1941 KOps/s | |
test_stacked_getleaf | 37.6300μs | 10.9525μs | 91.3032 KOps/s | 91.5084 KOps/s | |
test_stacked_get | 37.6200μs | 10.4367μs | 95.8156 KOps/s | 98.9708 KOps/s | |
test_nested_getitemleaf | 31.4890μs | 11.5220μs | 86.7903 KOps/s | 88.6199 KOps/s | |
test_nested_getitem | 35.5760μs | 10.6559μs | 93.8447 KOps/s | 94.8312 KOps/s | |
test_stacked_getitemleaf | 46.3460μs | 11.4298μs | 87.4907 KOps/s | 89.2929 KOps/s | |
test_stacked_getitem | 36.2380μs | 10.5261μs | 95.0015 KOps/s | 95.6932 KOps/s | |
test_lock_nested | 0.9578ms | 0.4407ms | 2.2689 KOps/s | 2.2566 KOps/s | |
test_lock_stack_nested | 0.5081ms | 0.4127ms | 2.4230 KOps/s | 2.4409 KOps/s | |
test_unlock_nested | 0.7124ms | 0.3568ms | 2.8026 KOps/s | 2.7502 KOps/s | |
test_unlock_stack_nested | 0.7156ms | 0.3331ms | 3.0018 KOps/s | 3.0229 KOps/s | |
test_flatten_speed | 0.1877ms | 96.3639μs | 10.3773 KOps/s | 10.5936 KOps/s | |
test_unflatten_speed | 0.7136ms | 0.5014ms | 1.9944 KOps/s | 2.0194 KOps/s | |
test_common_ops | 4.8915ms | 0.7997ms | 1.2505 KOps/s | 1.2555 KOps/s | |
test_creation | 28.3030μs | 2.2826μs | 438.0910 KOps/s | 477.9695 KOps/s | |
test_creation_empty | 41.4880μs | 12.1974μs | 81.9847 KOps/s | 79.0993 KOps/s | |
test_creation_nested_1 | 72.7960μs | 15.0652μs | 66.3784 KOps/s | 64.5231 KOps/s | |
test_creation_nested_2 | 54.0510μs | 19.3344μs | 51.7212 KOps/s | 49.7255 KOps/s | |
test_clone | 1.4731ms | 13.3993μs | 74.6310 KOps/s | 74.5387 KOps/s | |
test_getitem[int] | 0.8650ms | 12.5395μs | 79.7482 KOps/s | 80.4626 KOps/s | |
test_getitem[slice_int] | 0.1367ms | 24.9826μs | 40.0278 KOps/s | 40.8468 KOps/s | |
test_getitem[range] | 0.1736ms | 49.6067μs | 20.1586 KOps/s | 20.2562 KOps/s | |
test_getitem[tuple] | 0.1315ms | 20.3610μs | 49.1136 KOps/s | 50.0318 KOps/s | |
test_getitem[list] | 0.1661ms | 45.2683μs | 22.0905 KOps/s | 22.7050 KOps/s | |
test_setitem_dim[int] | 59.1800μs | 25.8309μs | 38.7134 KOps/s | 39.9616 KOps/s | |
test_setitem_dim[slice_int] | 0.1083ms | 54.1535μs | 18.4660 KOps/s | 19.5224 KOps/s | |
test_setitem_dim[range] | 0.1146ms | 73.6952μs | 13.5694 KOps/s | 13.5708 KOps/s | |
test_setitem_dim[tuple] | 69.6910μs | 41.6073μs | 24.0342 KOps/s | 24.6045 KOps/s | |
test_setitem | 71.6740μs | 20.9895μs | 47.6428 KOps/s | 47.4012 KOps/s | |
test_set | 0.1140ms | 20.3896μs | 49.0447 KOps/s | 48.3433 KOps/s | |
test_set_shared | 1.1482ms | 0.1678ms | 5.9596 KOps/s | 6.0010 KOps/s | |
test_update | 0.1381ms | 23.2608μs | 42.9908 KOps/s | 41.9079 KOps/s | |
test_update_nested | 0.1544ms | 33.2612μs | 30.0650 KOps/s | 29.1453 KOps/s | |
test_update__nested | 0.4045ms | 31.9766μs | 31.2728 KOps/s | 31.0052 KOps/s | |
test_set_nested | 0.2230ms | 22.6279μs | 44.1933 KOps/s | 43.4991 KOps/s | |
test_set_nested_new | 0.2118ms | 27.4574μs | 36.4201 KOps/s | 36.6719 KOps/s | |
test_select | 0.2192ms | 45.1976μs | 22.1251 KOps/s | 22.6826 KOps/s | |
test_select_nested | 0.1320ms | 59.8592μs | 16.7059 KOps/s | 16.8160 KOps/s | |
test_exclude_nested | 0.1569ms | 79.9248μs | 12.5118 KOps/s | 12.7405 KOps/s | |
test_empty[True] | 0.5662ms | 0.3847ms | 2.5992 KOps/s | 2.5830 KOps/s | |
test_empty[False] | 11.3235μs | 1.2342μs | 810.2670 KOps/s | 821.3709 KOps/s | |
test_unbind_speed | 0.3770ms | 0.2606ms | 3.8366 KOps/s | 3.7770 KOps/s | |
test_unbind_speed_stack0 | 0.6239ms | 0.2591ms | 3.8599 KOps/s | 3.8929 KOps/s | |
test_unbind_speed_stack1 | 95.2208ms | 0.8180ms | 1.2225 KOps/s | 1.4590 KOps/s | |
test_split | 0.1024s | 1.7751ms | 563.3535 Ops/s | 580.7394 Ops/s | |
test_chunk | 0.1078s | 1.7779ms | 562.4733 Ops/s | 575.8273 Ops/s | |
test_consolidate_njt[False-None] | 8.9386ms | 8.2526ms | 121.1745 Ops/s | 122.9759 Ops/s | |
test_creation[device0] | 0.2777ms | 92.0276μs | 10.8663 KOps/s | 10.8229 KOps/s | |
test_creation_from_tensor | 3.6470ms | 95.2526μs | 10.4984 KOps/s | 10.4926 KOps/s | |
test_add_one[memmap_tensor0] | 0.2235ms | 4.9249μs | 203.0487 KOps/s | 210.3187 KOps/s | |
test_contiguous[memmap_tensor0] | 13.4850μs | 0.5120μs | 1.9532 MOps/s | 1.8595 MOps/s | |
test_stack[memmap_tensor0] | 74.1390μs | 3.5333μs | 283.0202 KOps/s | 285.8714 KOps/s | |
test_memmaptd_index | 0.9957ms | 0.2428ms | 4.1185 KOps/s | 4.2108 KOps/s | |
test_memmaptd_index_astensor | 0.5913ms | 0.3225ms | 3.1005 KOps/s | 3.1670 KOps/s | |
test_memmaptd_index_op | 1.1363ms | 0.6105ms | 1.6379 KOps/s | 1.6636 KOps/s | |
test_serialize_model | 0.1219s | 0.1165s | 8.5865 Ops/s | 8.5185 Ops/s | |
test_serialize_model_pickle | 0.4635s | 0.3972s | 2.5177 Ops/s | 2.5773 Ops/s | |
test_serialize_weights | 0.1272s | 0.1163s | 8.6021 Ops/s | 8.7961 Ops/s | |
test_serialize_weights_returnearly | 0.1678s | 0.1597s | 6.2629 Ops/s | 6.4348 Ops/s | |
test_serialize_weights_pickle | 1.1998s | 0.7098s | 1.4089 Ops/s | 2.5452 Ops/s | |
test_serialize_weights_filesystem | 0.1545s | 0.1418s | 7.0528 Ops/s | 6.9516 Ops/s | |
test_serialize_model_filesystem | 0.2544s | 0.1575s | 6.3496 Ops/s | 5.9303 Ops/s | |
test_reshape_pytree | 0.1077ms | 26.5726μs | 37.6328 KOps/s | 35.7231 KOps/s | |
test_reshape_td | 65.1710μs | 32.7385μs | 30.5451 KOps/s | 29.7947 KOps/s | |
test_view_pytree | 64.0600μs | 26.7997μs | 37.3138 KOps/s | 37.1889 KOps/s | |
test_view_td | 81.5120μs | 37.5317μs | 26.6441 KOps/s | 25.5785 KOps/s | |
test_unbind_pytree | 65.5730μs | 29.7348μs | 33.6306 KOps/s | 33.1384 KOps/s | |
test_unbind_td | 0.3495ms | 38.4733μs | 25.9921 KOps/s | 25.5654 KOps/s | |
test_split_pytree | 92.9740μs | 29.5295μs | 33.8644 KOps/s | 33.7814 KOps/s | |
test_split_td | 0.2065ms | 44.5621μs | 22.4406 KOps/s | 22.4993 KOps/s | |
test_add_pytree | 85.7700μs | 35.8553μs | 27.8899 KOps/s | 27.8485 KOps/s | |
test_add_td | 0.1204ms | 58.9386μs | 16.9668 KOps/s | 17.4549 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1863ms | 63.5420μs | 15.7376 KOps/s | 16.0664 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.4305ms | 0.1618ms | 6.1805 KOps/s | 6.1799 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1151ms | 45.9278μs | 21.7733 KOps/s | 21.7236 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2238ms | 0.1200ms | 8.3334 KOps/s | 8.3978 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 78.7670μs | 26.7138μs | 37.4339 KOps/s | 37.9560 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1178ms | 54.3315μs | 18.4055 KOps/s | 18.5156 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1596ms | 78.4112μs | 12.7533 KOps/s | 12.6968 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1294ms | 67.4611μs | 14.8233 KOps/s | 14.7467 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1843ms | 0.1050ms | 9.5224 KOps/s | 9.5247 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3552ms | 0.2005ms | 4.9876 KOps/s | 4.9941 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1257ms | 45.2383μs | 22.1052 KOps/s | 22.1112 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4961ms | 62.8109μs | 15.9208 KOps/s | 16.1116 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1713ms | 0.1029ms | 9.7206 KOps/s | 9.8270 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3281ms | 0.2028ms | 4.9317 KOps/s | 5.0014 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.2995ms | 0.2111ms | 4.7362 KOps/s | 4.7499 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2371ms | 0.1079ms | 9.2672 KOps/s | 9.5197 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2094ms | 56.6629μs | 17.6482 KOps/s | 18.3124 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2330ms | 46.5809μs | 21.4680 KOps/s | 21.6182 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 1.6912ms | 0.1593ms | 6.2793 KOps/s | 6.2768 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1844ms | 0.1032ms | 9.6921 KOps/s | 9.7437 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 62.1560μs | 21.6756μs | 46.1349 KOps/s | 46.4232 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1540ms | 60.1717μs | 16.6191 KOps/s | 16.9678 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1745ms | 80.8754μs | 12.3647 KOps/s | 12.4447 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1580ms | 69.2619μs | 14.4379 KOps/s | 14.6621 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2899ms | 0.2054ms | 4.8692 KOps/s | 4.8653 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.2965ms | 1.3213ms | 756.8493 Ops/s | 774.3006 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3476ms | 0.2025ms | 4.9371 KOps/s | 4.9037 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.4116ms | 0.7860ms | 1.2723 KOps/s | 1.2725 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.8104ms | 0.4585ms | 2.1812 KOps/s | 2.2151 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.7010ms | 2.7891ms | 358.5372 Ops/s | 359.2957 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 87.5340μs | 35.7266μs | 27.9904 KOps/s | 27.6130 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5449ms | 34.5334μs | 28.9574 KOps/s | 30.9649 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 82.4340μs | 29.9334μs | 33.4075 KOps/s | 34.2046 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 61.6650μs | 23.7634μs | 42.0815 KOps/s | 43.6802 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 89.4770μs | 30.6937μs | 32.5800 KOps/s | 33.7693 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1165ms | 23.4789μs | 42.5915 KOps/s | 43.6007 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1249ms | 51.7991μs | 19.3054 KOps/s | 19.5317 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.6168ms | 20.2795μs | 49.3110 KOps/s | 51.2073 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1227ms | 43.7861μs | 22.8383 KOps/s | 22.6450 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 92.0520μs | 18.7799μs | 53.2484 KOps/s | 53.4875 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1246ms | 45.0920μs | 22.1769 KOps/s | 22.0837 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 82.1440μs | 18.7534μs | 53.3236 KOps/s | 53.0449 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1452ms | 52.9564μs | 18.8835 KOps/s | 19.1258 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9503ms | 20.2777μs | 49.3152 KOps/s | 51.0906 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1383ms | 44.8435μs | 22.2998 KOps/s | 22.0719 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 70.3420μs | 18.7543μs | 53.3210 KOps/s | 52.8427 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1135ms | 44.9388μs | 22.2525 KOps/s | 22.1138 KOps/s | |
test_compile_indexing[int-pytree-eager] | 79.1880μs | 18.7520μs | 53.3276 KOps/s | 52.7335 KOps/s | |
test_mod_add[eager] | 0.1171ms | 34.9610μs | 28.6033 KOps/s | 27.4818 KOps/s | |
test_mod_add[compile] | 0.1049ms | 47.2363μs | 21.1702 KOps/s | 21.0926 KOps/s | |
test_mod_add[compile-overhead] | 0.1029ms | 47.0607μs | 21.2492 KOps/s | 21.0630 KOps/s | |
test_mod_wrap[eager] | 0.3543ms | 0.2219ms | 4.5061 KOps/s | 4.3319 KOps/s | |
test_mod_wrap[compile] | 0.3504ms | 0.2043ms | 4.8944 KOps/s | 4.8240 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3981ms | 0.2063ms | 4.8473 KOps/s | 4.8308 KOps/s | |
test_mod_wrap_and_backward[eager] | 12.8880ms | 11.1583ms | 89.6192 Ops/s | 85.8502 Ops/s | |
test_mod_wrap_and_backward[compile] | 12.4491ms | 11.0243ms | 90.7086 Ops/s | 89.7290 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.0655ms | 11.1437ms | 89.7365 Ops/s | 90.5023 Ops/s | |
test_seq_add[eager] | 0.2225ms | 0.1148ms | 8.7110 KOps/s | 8.1896 KOps/s | |
test_seq_add[compile] | 0.1381ms | 61.7229μs | 16.2014 KOps/s | 16.0074 KOps/s | |
test_seq_add[compile-overhead] | 0.1359ms | 60.0812μs | 16.6441 KOps/s | 16.5018 KOps/s | |
test_seq_wrap[eager] | 0.7163ms | 0.4377ms | 2.2848 KOps/s | 2.1806 KOps/s | |
test_seq_wrap[compile] | 0.4267ms | 0.2259ms | 4.4276 KOps/s | 4.2571 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4065ms | 0.2257ms | 4.4309 KOps/s | 4.3337 KOps/s | |
test_func_call_runtime[False-eager] | 0.9429ms | 0.5521ms | 1.8114 KOps/s | 1.7833 KOps/s | |
test_func_call_runtime[False-compile] | 0.5252ms | 0.4259ms | 2.3482 KOps/s | 2.3019 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8218ms | 0.4266ms | 2.3442 KOps/s | 2.3147 KOps/s | |
test_func_call_runtime[True-eager] | 0.8892ms | 0.7627ms | 1.3112 KOps/s | 1.2930 KOps/s | |
test_func_call_runtime[True-compile] | 0.5769ms | 0.4678ms | 2.1376 KOps/s | 2.1320 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5714ms | 0.4705ms | 2.1253 KOps/s | 2.1411 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.1115ms | 0.5555ms | 1.8003 KOps/s | 1.8240 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5322ms | 0.4242ms | 2.3572 KOps/s | 2.3452 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.8900ms | 0.4318ms | 2.3159 KOps/s | 2.3481 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0329ms | 0.9019ms | 1.1088 KOps/s | 1.1196 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.7186ms | 0.4938ms | 2.0253 KOps/s | 2.0113 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.8794ms | 0.4954ms | 2.0186 KOps/s | 2.0054 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4579ms | 1.8983ms | 526.7883 Ops/s | 518.1470 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.6540ms | 0.5191ms | 1.9264 KOps/s | 1.8981 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7202ms | 0.5247ms | 1.9058 KOps/s | 1.8866 KOps/s | |
test_distributed | 0.2403ms | 0.1255ms | 7.9709 KOps/s | 7.8263 KOps/s | |
test_tdmodule | 99.7860μs | 27.6518μs | 36.1641 KOps/s | 36.0669 KOps/s | |
test_tdmodule_dispatch | 79.6990μs | 48.5244μs | 20.6082 KOps/s | 19.4319 KOps/s | |
test_tdseq | 49.0820μs | 27.3153μs | 36.6095 KOps/s | 35.2808 KOps/s | |
test_tdseq_dispatch | 86.8930μs | 53.1970μs | 18.7981 KOps/s | 18.3843 KOps/s | |
test_instantiation_functorch | 2.0437ms | 1.5493ms | 645.4560 Ops/s | 648.3995 Ops/s | |
test_exec_functorch | 0.2659ms | 0.1811ms | 5.5213 KOps/s | 5.3988 KOps/s | |
test_exec_functional_call | 0.2722ms | 0.1723ms | 5.8036 KOps/s | 5.6431 KOps/s | |
test_exec_td_decorator | 0.5221ms | 0.2301ms | 4.3461 KOps/s | 4.2733 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.1949ms | 0.6681ms | 1.4968 KOps/s | 1.4873 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0177ms | 0.6627ms | 1.5089 KOps/s | 1.4954 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8287ms | 0.5301ms | 1.8864 KOps/s | 1.8639 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7041ms | 0.5258ms | 1.9017 KOps/s | 1.8521 KOps/s | |
test_to_module_speed[True] | 1.4183ms | 1.2951ms | 772.1513 Ops/s | 774.9341 Ops/s | |
test_to_module_speed[False] | 1.3495ms | 1.2542ms | 797.3329 Ops/s | 787.9784 Ops/s | |
test_tc_init | 89.1370μs | 46.9950μs | 21.2789 KOps/s | 20.5525 KOps/s | |
test_tc_init_nested | 0.1549ms | 92.5622μs | 10.8035 KOps/s | 10.4041 KOps/s | |
test_tc_first_layer_tensor | 32.6520μs | 1.5551μs | 643.0276 KOps/s | 657.7241 KOps/s | |
test_tc_first_layer_nontensor | 44.5730μs | 4.8001μs | 208.3281 KOps/s | 207.6254 KOps/s | |
test_tc_second_layer_tensor | 24.6160μs | 2.8755μs | 347.7624 KOps/s | 348.5899 KOps/s | |
test_tc_second_layer_nontensor | 49.6430μs | 6.1211μs | 163.3685 KOps/s | 163.9168 KOps/s | |
test_unbind | 0.2109s | 13.9836ms | 71.5124 Ops/s | 81.4954 Ops/s | |
test_full_like | 8.1356ms | 7.2293ms | 138.3251 Ops/s | 76.9472 Ops/s | |
test_zeros_like | 3.2371ms | 2.7219ms | 367.3911 Ops/s | 133.0216 Ops/s | |
test_ones_like | 11.9239ms | 6.2486ms | 160.0349 Ops/s | 125.9021 Ops/s | |
test_clone | 12.5400ms | 8.0934ms | 123.5579 Ops/s | 103.1864 Ops/s | |
test_squeeze | 65.4120μs | 11.6145μs | 86.0995 KOps/s | 85.1183 KOps/s | |
test_unsqueeze | 0.2971ms | 89.6676μs | 11.1523 KOps/s | 11.0582 KOps/s | |
test_split | 0.4025ms | 0.1938ms | 5.1607 KOps/s | 5.2258 KOps/s | |
test_permute | 0.4298ms | 0.2211ms | 4.5222 KOps/s | 4.6250 KOps/s | |
test_stack | 33.0455ms | 26.0787ms | 38.3455 Ops/s | 39.7281 Ops/s | |
test_cat | 30.4396ms | 26.5291ms | 37.6944 Ops/s | 38.2788 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 33.1910μs | 11.1467μs | 89.7130 KOps/s | 98.7330 KOps/s | |
test_plain_set_stack_nested | 50.4210μs | 11.2150μs | 89.1667 KOps/s | 99.0278 KOps/s | |
test_plain_set_nested_inplace | 55.4610μs | 11.9775μs | 83.4898 KOps/s | 90.5493 KOps/s | |
test_plain_set_stack_nested_inplace | 38.7200μs | 12.0119μs | 83.2507 KOps/s | 91.0921 KOps/s | |
test_items | 31.4410μs | 2.9051μs | 344.2199 KOps/s | 341.3686 KOps/s | |
test_items_nested | 0.4198ms | 0.3505ms | 2.8529 KOps/s | 2.8562 KOps/s | |
test_items_nested_locked | 0.4220ms | 0.3511ms | 2.8484 KOps/s | 2.8246 KOps/s | |
test_items_nested_leaf | 81.1810μs | 58.0229μs | 17.2346 KOps/s | 17.2810 KOps/s | |
test_items_stack_nested | 0.3934ms | 0.3515ms | 2.8448 KOps/s | 2.8356 KOps/s | |
test_items_stack_nested_leaf | 91.4410μs | 58.1611μs | 17.1936 KOps/s | 16.8205 KOps/s | |
test_items_stack_nested_locked | 0.3932ms | 0.3533ms | 2.8304 KOps/s | 2.8493 KOps/s | |
test_keys | 26.5610μs | 3.4505μs | 289.8089 KOps/s | 284.4356 KOps/s | |
test_keys_nested | 0.1206ms | 69.7502μs | 14.3369 KOps/s | 14.3233 KOps/s | |
test_keys_nested_locked | 0.7609ms | 75.2960μs | 13.2809 KOps/s | 13.0873 KOps/s | |
test_keys_nested_leaf | 96.7310μs | 61.1264μs | 16.3596 KOps/s | 16.2843 KOps/s | |
test_keys_stack_nested | 0.1015ms | 70.2266μs | 14.2396 KOps/s | 14.2895 KOps/s | |
test_keys_stack_nested_leaf | 91.3910μs | 61.6095μs | 16.2313 KOps/s | 16.0821 KOps/s | |
test_keys_stack_nested_locked | 0.1126ms | 75.2680μs | 13.2859 KOps/s | 13.1579 KOps/s | |
test_values | 5.3217μs | 0.8511μs | 1.1749 MOps/s | 1.1734 MOps/s | |
test_values_nested | 66.4500μs | 31.0616μs | 32.1941 KOps/s | 32.1424 KOps/s | |
test_values_nested_locked | 96.2910μs | 32.5385μs | 30.7328 KOps/s | 30.7212 KOps/s | |
test_values_nested_leaf | 86.7210μs | 33.4553μs | 29.8907 KOps/s | 29.8005 KOps/s | |
test_values_stack_nested | 62.6010μs | 31.3183μs | 31.9302 KOps/s | 31.5761 KOps/s | |
test_values_stack_nested_leaf | 68.9510μs | 33.5217μs | 29.8314 KOps/s | 29.3982 KOps/s | |
test_values_stack_nested_locked | 63.2700μs | 32.9127μs | 30.3834 KOps/s | 30.3179 KOps/s | |
test_membership | 3.2721μs | 0.5062μs | 1.9753 MOps/s | 1.9565 MOps/s | |
test_membership_nested | 24.1100μs | 2.0118μs | 497.0776 KOps/s | 509.2486 KOps/s | |
test_membership_nested_leaf | 15.6350μs | 2.0193μs | 495.2302 KOps/s | 498.1728 KOps/s | |
test_membership_stacked_nested | 44.0300μs | 2.0827μs | 480.1489 KOps/s | 490.9721 KOps/s | |
test_membership_stacked_nested_leaf | 53.5810μs | 2.0685μs | 483.4305 KOps/s | 489.5933 KOps/s | |
test_membership_nested_last | 34.3310μs | 2.9252μs | 341.8560 KOps/s | 344.5376 KOps/s | |
test_membership_nested_leaf_last | 27.5700μs | 2.9378μs | 340.3882 KOps/s | 341.7900 KOps/s | |
test_membership_stacked_nested_last | 26.5000μs | 2.9297μs | 341.3377 KOps/s | 281.2714 KOps/s | |
test_membership_stacked_nested_leaf_last | 39.3710μs | 2.9710μs | 336.5889 KOps/s | 279.9409 KOps/s | |
test_nested_getleaf | 34.9400μs | 6.1009μs | 163.9115 KOps/s | 163.7321 KOps/s | |
test_nested_get | 39.5800μs | 5.8402μs | 171.2281 KOps/s | 172.0528 KOps/s | |
test_stacked_getleaf | 39.4800μs | 6.1113μs | 163.6326 KOps/s | 164.2807 KOps/s | |
test_stacked_get | 41.7600μs | 5.8290μs | 171.5573 KOps/s | 171.2403 KOps/s | |
test_nested_getitemleaf | 35.1200μs | 6.2165μs | 160.8631 KOps/s | 161.3817 KOps/s | |
test_nested_getitem | 37.9900μs | 5.8564μs | 170.7532 KOps/s | 169.5891 KOps/s | |
test_stacked_getitemleaf | 40.5400μs | 6.1637μs | 162.2401 KOps/s | 160.9330 KOps/s | |
test_stacked_getitem | 36.1000μs | 5.8769μs | 170.1570 KOps/s | 170.2291 KOps/s | |
test_lock_nested | 0.9253ms | 0.3590ms | 2.7858 KOps/s | 2.6923 KOps/s | |
test_lock_stack_nested | 0.3953ms | 0.3345ms | 2.9894 KOps/s | 2.9782 KOps/s | |
test_unlock_nested | 0.6504ms | 0.3031ms | 3.2989 KOps/s | 3.2788 KOps/s | |
test_unlock_stack_nested | 0.3089ms | 0.2739ms | 3.6515 KOps/s | 3.6734 KOps/s | |
test_flatten_speed | 0.1047ms | 74.6648μs | 13.3932 KOps/s | 13.5151 KOps/s | |
test_unflatten_speed | 0.3611ms | 0.3054ms | 3.2742 KOps/s | 3.2775 KOps/s | |
test_common_ops | 1.5965ms | 0.6079ms | 1.6450 KOps/s | 1.7195 KOps/s | |
test_creation | 0.1074ms | 1.4642μs | 682.9852 KOps/s | 672.6815 KOps/s | |
test_creation_empty | 31.2200μs | 8.5971μs | 116.3188 KOps/s | 153.4251 KOps/s | |
test_creation_nested_1 | 32.3010μs | 10.2020μs | 98.0197 KOps/s | 124.5378 KOps/s | |
test_creation_nested_2 | 44.8700μs | 12.6432μs | 79.0939 KOps/s | 94.0992 KOps/s | |
test_clone | 0.1159ms | 10.3660μs | 96.4689 KOps/s | 93.6182 KOps/s | |
test_getitem[int] | 1.1478ms | 10.3665μs | 96.4645 KOps/s | 94.5748 KOps/s | |
test_getitem[slice_int] | 0.1258ms | 20.5254μs | 48.7201 KOps/s | 47.6808 KOps/s | |
test_getitem[range] | 0.1387ms | 37.6767μs | 26.5416 KOps/s | 26.3357 KOps/s | |
test_getitem[tuple] | 0.1068ms | 17.8308μs | 56.0826 KOps/s | 55.0087 KOps/s | |
test_getitem[list] | 0.1318ms | 32.4783μs | 30.7898 KOps/s | 30.4360 KOps/s | |
test_setitem_dim[int] | 92.5810μs | 18.6076μs | 53.7416 KOps/s | 54.6371 KOps/s | |
test_setitem_dim[slice_int] | 62.1700μs | 38.1717μs | 26.1974 KOps/s | 26.6299 KOps/s | |
test_setitem_dim[range] | 94.1210μs | 53.3894μs | 18.7303 KOps/s | 18.9544 KOps/s | |
test_setitem_dim[tuple] | 53.6110μs | 31.2072μs | 32.0439 KOps/s | 31.9594 KOps/s | |
test_setitem | 48.0600μs | 15.2914μs | 65.3961 KOps/s | 70.2949 KOps/s | |
test_set | 0.1244ms | 14.8938μs | 67.1421 KOps/s | 72.5388 KOps/s | |
test_set_shared | 1.8354ms | 0.1455ms | 6.8731 KOps/s | 6.7878 KOps/s | |
test_update | 0.5192ms | 17.9132μs | 55.8248 KOps/s | 62.2484 KOps/s | |
test_update_nested | 65.9600μs | 22.8441μs | 43.7749 KOps/s | 47.9582 KOps/s | |
test_update__nested | 0.4733ms | 24.5228μs | 40.7784 KOps/s | 40.3362 KOps/s | |
test_set_nested | 0.1290ms | 16.0782μs | 62.1961 KOps/s | 66.5039 KOps/s | |
test_set_nested_new | 0.1415ms | 18.2562μs | 54.7760 KOps/s | 57.5473 KOps/s | |
test_select | 0.2248ms | 30.3088μs | 32.9937 KOps/s | 34.3869 KOps/s | |
test_select_nested | 73.2610μs | 42.3668μs | 23.6034 KOps/s | 23.9156 KOps/s | |
test_exclude_nested | 0.1129ms | 62.1018μs | 16.1026 KOps/s | 16.4909 KOps/s | |
test_empty[True] | 0.3402ms | 0.2721ms | 3.6746 KOps/s | 3.6261 KOps/s | |
test_empty[False] | 3.8280μs | 0.7477μs | 1.3375 MOps/s | 1.3327 MOps/s | |
test_to | 86.6810μs | 54.9607μs | 18.1948 KOps/s | 17.7672 KOps/s | |
test_to_nonblocking | 76.4710μs | 45.9196μs | 21.7772 KOps/s | 21.5986 KOps/s | |
test_unbind_speed | 0.3121ms | 0.2244ms | 4.4569 KOps/s | 4.4885 KOps/s | |
test_unbind_speed_stack0 | 0.3085ms | 0.2286ms | 4.3743 KOps/s | 4.3448 KOps/s | |
test_unbind_speed_stack1 | 99.9329ms | 0.6493ms | 1.5402 KOps/s | 1.5605 KOps/s | |
test_split | 98.4185ms | 1.7174ms | 582.2831 Ops/s | 586.1055 Ops/s | |
test_chunk | 0.1010s | 1.5883ms | 629.5948 Ops/s | 699.1171 Ops/s | |
test_consolidate[False-None] | 2.7042ms | 2.6172ms | 382.0821 Ops/s | 338.4213 Ops/s | |
test_consolidate[default-None] | 2.2738ms | 1.6991ms | 588.5380 Ops/s | 580.3087 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8776ms | 1.7247ms | 579.8267 Ops/s | 565.8293 Ops/s | |
test_consolidate_njt[False-None] | 6.6927ms | 6.4878ms | 154.1365 Ops/s | 152.2057 Ops/s | |
test_to[False-False-None] | 1.7934ms | 1.6737ms | 597.4822 Ops/s | 593.5639 Ops/s | |
test_to[True-False-None] | 1.5830ms | 1.2829ms | 779.4767 Ops/s | 796.1403 Ops/s | |
test_to[within-False-None] | 4.3024ms | 4.0348ms | 247.8444 Ops/s | 250.0095 Ops/s | |
test_to[True-default-None] | 5.3168ms | 5.0127ms | 199.4932 Ops/s | 194.1921 Ops/s | |
test_to_njt[False-False-None] | 6.9762ms | 6.8557ms | 145.8647 Ops/s | 143.8748 Ops/s | |
test_to_njt[True-False-None] | 6.4178ms | 5.4077ms | 184.9206 Ops/s | 182.9370 Ops/s | |
test_to_njt[within-False-None] | 12.0288ms | 11.8752ms | 84.2094 Ops/s | 82.8409 Ops/s | |
test_creation[device0] | 0.4707ms | 78.2463μs | 12.7802 KOps/s | 12.3740 KOps/s | |
test_creation_from_tensor | 0.5274ms | 82.2823μs | 12.1533 KOps/s | 11.9020 KOps/s | |
test_add_one[memmap_tensor0] | 0.2233ms | 6.7328μs | 148.5260 KOps/s | 142.9803 KOps/s | |
test_contiguous[memmap_tensor0] | 2.0150μs | 0.4146μs | 2.4117 MOps/s | 2.2539 MOps/s | |
test_stack[memmap_tensor0] | 43.4910μs | 4.3282μs | 231.0435 KOps/s | 222.8131 KOps/s | |
test_memmaptd_index | 1.8960ms | 0.2426ms | 4.1223 KOps/s | 3.9922 KOps/s | |
test_memmaptd_index_astensor | 0.5773ms | 0.3000ms | 3.3337 KOps/s | 3.2170 KOps/s | |
test_memmaptd_index_op | 1.0666ms | 0.5816ms | 1.7195 KOps/s | 1.7631 KOps/s | |
test_serialize_model | 0.1309s | 0.1301s | 7.6874 Ops/s | 7.6704 Ops/s | |
test_serialize_model_pickle | 1.3822s | 1.2192s | 0.8202 Ops/s | 0.8199 Ops/s | |
test_serialize_weights | 0.1307s | 0.1300s | 7.6919 Ops/s | 7.7268 Ops/s | |
test_serialize_weights_returnearly | 0.5245s | 72.8446ms | 13.7279 Ops/s | 15.3146 Ops/s | |
test_serialize_weights_pickle | 1.3826s | 1.2200s | 0.8196 Ops/s | 0.8207 Ops/s | |
test_reshape_pytree | 54.0500μs | 22.1331μs | 45.1812 KOps/s | 43.7270 KOps/s | |
test_reshape_td | 61.1500μs | 26.0000μs | 38.4616 KOps/s | 36.7920 KOps/s | |
test_view_pytree | 46.5110μs | 22.1915μs | 45.0623 KOps/s | 44.9582 KOps/s | |
test_view_td | 64.4810μs | 28.9856μs | 34.4999 KOps/s | 34.0358 KOps/s | |
test_unbind_pytree | 0.1570ms | 27.5893μs | 36.2460 KOps/s | 35.2087 KOps/s | |
test_unbind_td | 0.7968ms | 34.4746μs | 29.0069 KOps/s | 28.5286 KOps/s | |
test_split_pytree | 61.3300μs | 29.3730μs | 34.0449 KOps/s | 32.8563 KOps/s | |
test_split_td | 1.0103ms | 37.5055μs | 26.6628 KOps/s | 25.8675 KOps/s | |
test_add_pytree | 75.3800μs | 34.0507μs | 29.3679 KOps/s | 29.1994 KOps/s | |
test_add_td | 71.5410μs | 44.2670μs | 22.5902 KOps/s | 23.2661 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1755ms | 0.1185ms | 8.4424 KOps/s | 8.0261 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2441ms | 0.1208ms | 8.2780 KOps/s | 7.8651 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1338ms | 93.5971μs | 10.6841 KOps/s | 10.4183 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2060ms | 0.1457ms | 6.8615 KOps/s | 6.6547 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 55.2900μs | 21.3776μs | 46.7780 KOps/s | 43.7365 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 55.1810μs | 26.6561μs | 37.5148 KOps/s | 37.0051 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1021ms | 63.8072μs | 15.6722 KOps/s | 15.2538 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1768ms | 49.0705μs | 20.3788 KOps/s | 19.8420 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2146ms | 0.1406ms | 7.1100 KOps/s | 6.8546 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3429ms | 0.2064ms | 4.8447 KOps/s | 4.8116 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1406ms | 96.1748μs | 10.3977 KOps/s | 10.2392 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1364ms | 51.4987μs | 19.4180 KOps/s | 19.0413 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1796ms | 0.1341ms | 7.4543 KOps/s | 7.2964 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6392ms | 0.4723ms | 2.1171 KOps/s | 2.0627 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3593ms | 0.2455ms | 4.0729 KOps/s | 4.0295 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1853ms | 0.1430ms | 6.9928 KOps/s | 7.0185 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1584ms | 62.6732μs | 15.9558 KOps/s | 15.9463 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1451ms | 99.2794μs | 10.0726 KOps/s | 10.1076 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5485ms | 0.4042ms | 2.4741 KOps/s | 2.4646 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2575ms | 0.1402ms | 7.1303 KOps/s | 7.3957 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 56.4710μs | 17.6231μs | 56.7436 KOps/s | 55.9402 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1039ms | 26.9964μs | 37.0420 KOps/s | 37.7281 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.2035ms | 69.8711μs | 14.3121 KOps/s | 14.2676 KOps/s | |
test_compile_copy_flat[pytree-eager] | 88.1710μs | 51.3935μs | 19.4577 KOps/s | 19.4944 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6560ms | 0.3960ms | 2.5252 KOps/s | 2.0945 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.8714ms | 2.5665ms | 389.6327 Ops/s | 376.3000 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6259ms | 0.4450ms | 2.2473 KOps/s | 2.2082 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8439ms | 2.5976ms | 384.9757 Ops/s | 383.6552 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.5408ms | 0.1179ms | 8.4796 KOps/s | 8.6859 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5762ms | 79.2890μs | 12.6121 KOps/s | 12.5522 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.6012ms | 0.1087ms | 9.1962 KOps/s | 9.0866 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2162ms | 67.9074μs | 14.7259 KOps/s | 14.1059 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2611ms | 0.1139ms | 8.7833 KOps/s | 8.9285 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2222ms | 70.6783μs | 14.1486 KOps/s | 14.1032 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1460ms | 0.1004ms | 9.9642 KOps/s | 9.9176 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1402ms | 16.9702μs | 58.9269 KOps/s | 51.0413 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1492ms | 99.8473μs | 10.0153 KOps/s | 9.8681 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 51.4300μs | 17.2116μs | 58.1004 KOps/s | 60.8027 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1414ms | 0.1016ms | 9.8415 KOps/s | 9.7360 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1532ms | 16.7822μs | 59.5871 KOps/s | 61.7090 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2597ms | 0.1007ms | 9.9313 KOps/s | 9.3112 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5774ms | 16.8592μs | 59.3147 KOps/s | 57.0920 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1350ms | 96.2489μs | 10.3897 KOps/s | 10.2477 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 42.8700μs | 15.9928μs | 62.5283 KOps/s | 63.3775 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2453ms | 0.1010ms | 9.9014 KOps/s | 9.7325 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.4979ms | 16.7455μs | 59.7176 KOps/s | 62.0440 KOps/s | |
test_mod_add[eager] | 0.1022ms | 37.8617μs | 26.4119 KOps/s | 27.3349 KOps/s | |
test_mod_add[compile] | 0.2677ms | 84.6622μs | 11.8116 KOps/s | 12.0766 KOps/s | |
test_mod_add[compile-overhead] | 0.3387ms | 0.1694ms | 5.9045 KOps/s | 5.3858 KOps/s | |
test_mod_wrap[eager] | 0.4238ms | 0.2638ms | 3.7909 KOps/s | 3.9754 KOps/s | |
test_mod_wrap[compile] | 0.4219ms | 0.2790ms | 3.5840 KOps/s | 3.2655 KOps/s | |
test_mod_wrap[compile-overhead] | 7.1184ms | 3.8114ms | 262.3735 Ops/s | 263.0406 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6413ms | 1.4474ms | 690.9147 Ops/s | 725.4516 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4486ms | 1.3549ms | 738.0695 Ops/s | 791.7996 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.5987ms | 1.0447ms | 957.1963 Ops/s | 1.0635 KOps/s | |
test_seq_add[eager] | 0.2616ms | 0.1104ms | 9.0584 KOps/s | 8.8978 KOps/s | |
test_seq_add[compile] | 0.1361ms | 86.2874μs | 11.5892 KOps/s | 11.1929 KOps/s | |
test_seq_add[compile-overhead] | 0.3227ms | 0.1272ms | 7.8631 KOps/s | 7.5861 KOps/s | |
test_seq_wrap[eager] | 0.4783ms | 0.4139ms | 2.4161 KOps/s | 2.3568 KOps/s | |
test_seq_wrap[compile] | 0.4492ms | 0.2954ms | 3.3848 KOps/s | 3.2551 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2710ms | 0.2205ms | 4.5351 KOps/s | 4.4559 KOps/s | |
test_func_call_runtime[False-eager] | 0.8509ms | 0.7296ms | 1.3706 KOps/s | 1.3615 KOps/s | |
test_func_call_runtime[False-compile] | 0.9344ms | 0.7489ms | 1.3352 KOps/s | 1.3589 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4769ms | 0.3610ms | 2.7702 KOps/s | 2.7306 KOps/s | |
test_func_call_runtime[True-eager] | 1.0303ms | 0.8880ms | 1.1261 KOps/s | 1.1084 KOps/s | |
test_func_call_runtime[True-compile] | 0.9212ms | 0.7894ms | 1.2668 KOps/s | 1.3179 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4349ms | 0.3836ms | 2.6068 KOps/s | 2.5788 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8039ms | 0.7230ms | 1.3831 KOps/s | 1.3495 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8433ms | 0.7533ms | 1.3275 KOps/s | 1.3544 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4156ms | 0.3630ms | 2.7546 KOps/s | 2.7279 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1390ms | 0.9863ms | 1.0138 KOps/s | 994.3820 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.8449ms | 0.7898ms | 1.2661 KOps/s | 1.2681 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.6321ms | 0.4123ms | 2.4253 KOps/s | 2.4059 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5479ms | 2.0599ms | 485.4598 Ops/s | 478.7605 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9410ms | 0.8035ms | 1.2446 KOps/s | 1.2191 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4583ms | 0.4116ms | 2.4297 KOps/s | 2.3838 KOps/s | |
test_distributed | 2.7510ms | 0.2365ms | 4.2284 KOps/s | 8.7912 KOps/s | |
test_tdmodule | 32.0010μs | 19.0046μs | 52.6187 KOps/s | 56.5904 KOps/s | |
test_tdmodule_dispatch | 83.9110μs | 34.3159μs | 29.1410 KOps/s | 31.2856 KOps/s | |
test_tdseq | 40.0900μs | 19.0248μs | 52.5629 KOps/s | 54.9556 KOps/s | |
test_tdseq_dispatch | 64.7510μs | 37.0104μs | 27.0194 KOps/s | 28.8575 KOps/s | |
test_instantiation_functorch | 1.6932ms | 1.5634ms | 639.6281 Ops/s | 636.7230 Ops/s | |
test_exec_functorch | 0.1983ms | 0.1447ms | 6.9085 KOps/s | 6.9167 KOps/s | |
test_exec_functional_call | 0.2057ms | 0.1400ms | 7.1448 KOps/s | 7.1737 KOps/s | |
test_exec_td_decorator | 0.3742ms | 0.1871ms | 5.3447 KOps/s | 5.3490 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8963ms | 0.6882ms | 1.4531 KOps/s | 1.4539 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9235ms | 0.6861ms | 1.4575 KOps/s | 1.4504 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7216ms | 0.5962ms | 1.6772 KOps/s | 1.6484 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8156ms | 0.5976ms | 1.6734 KOps/s | 1.6786 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.3638ms | 19.4625ms | 51.3808 Ops/s | 52.3957 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.5579ms | 19.5417ms | 51.1726 Ops/s | 52.3197 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.8694ms | 19.3031ms | 51.8050 Ops/s | 52.8125 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.9713ms | 19.2663ms | 51.9040 Ops/s | 52.6550 Ops/s | |
test_to_module_speed[True] | 1.2743ms | 0.9352ms | 1.0693 KOps/s | 1.0616 KOps/s | |
test_to_module_speed[False] | 1.4133ms | 0.9190ms | 1.0881 KOps/s | 1.0777 KOps/s | |
test_tc_init | 64.7010μs | 35.0830μs | 28.5038 KOps/s | 30.0350 KOps/s | |
test_tc_init_nested | 0.1061ms | 71.2113μs | 14.0427 KOps/s | 14.5636 KOps/s | |
test_tc_first_layer_tensor | 4.7987μs | 0.7013μs | 1.4260 MOps/s | 1.4024 MOps/s | |
test_tc_first_layer_nontensor | 22.9010μs | 2.3288μs | 429.3968 KOps/s | 427.7521 KOps/s | |
test_tc_second_layer_tensor | 22.5778μs | 1.4128μs | 707.8046 KOps/s | 702.4825 KOps/s | |
test_tc_second_layer_nontensor | 0.2796ms | 3.0501μs | 327.8617 KOps/s | 323.6898 KOps/s | |
test_unbind | 0.2301s | 10.0434ms | 99.5677 Ops/s | 151.2010 Ops/s | |
test_full_like | 13.2578ms | 9.7277ms | 102.7996 Ops/s | 101.3283 Ops/s | |
test_zeros_like | 9.4929ms | 7.2851ms | 137.2672 Ops/s | 227.4447 Ops/s | |
test_ones_like | 5.3032ms | 4.4257ms | 225.9514 Ops/s | 231.6334 Ops/s | |
test_clone | 7.7813ms | 6.8354ms | 146.2981 Ops/s | 145.3506 Ops/s | |
test_squeeze | 0.1541ms | 9.0631μs | 110.3377 KOps/s | 108.7411 KOps/s | |
test_unsqueeze | 0.1136ms | 66.5784μs | 15.0199 KOps/s | 14.7476 KOps/s | |
test_split | 0.3972ms | 0.1512ms | 6.6146 KOps/s | 6.4233 KOps/s | |
test_permute | 0.2760ms | 0.1710ms | 5.8493 KOps/s | 5.5920 KOps/s | |
test_stack | 54.2521ms | 52.5314ms | 19.0362 Ops/s | 19.3588 Ops/s | |
test_cat | 52.2857ms | 51.4988ms | 19.4179 Ops/s | 19.3274 Ops/s |
vmoens
added a commit
that referenced
this pull request
Dec 2, 2024
ghstack-source-id: be142a150bf4378a0806347257c3cf64c78e4eda Pull Request resolved: #1120
vmoens
added a commit
that referenced
this pull request
Dec 2, 2024
ghstack-source-id: be142a150bf4378a0806347257c3cf64c78e4eda Pull Request resolved: #1120
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):