-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Versioning] v0.6.1 #1072
Merged
Merged
[Versioning] v0.6.1 #1072
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Nov 4, 2024
ghstack-source-id: a899c95c12a3b1b986ed429b6507711c4126189e Pull Request resolved: #1072
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Nov 4, 2024
vmoens
added a commit
that referenced
this pull request
Nov 4, 2024
ghstack-source-id: a899c95c12a3b1b986ed429b6507711c4126189e Pull Request resolved: #1072
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 40.6250μs | 17.1584μs | 58.2806 KOps/s | 52.7423 KOps/s | |
test_plain_set_stack_nested | 65.5120μs | 17.3276μs | 57.7114 KOps/s | 52.4996 KOps/s | |
test_plain_set_nested_inplace | 48.1890μs | 18.8747μs | 52.9811 KOps/s | 48.8148 KOps/s | |
test_plain_set_stack_nested_inplace | 78.9160μs | 19.0455μs | 52.5057 KOps/s | 48.8696 KOps/s | |
test_items | 23.6240μs | 4.2024μs | 237.9592 KOps/s | 233.3797 KOps/s | |
test_items_nested | 0.7047ms | 0.3432ms | 2.9137 KOps/s | 2.9177 KOps/s | |
test_items_nested_locked | 0.6391ms | 0.3454ms | 2.8956 KOps/s | 2.9501 KOps/s | |
test_items_nested_leaf | 0.1392ms | 72.3845μs | 13.8151 KOps/s | 13.6558 KOps/s | |
test_items_stack_nested | 0.7142ms | 0.3474ms | 2.8789 KOps/s | 2.9224 KOps/s | |
test_items_stack_nested_leaf | 0.1351ms | 76.4830μs | 13.0748 KOps/s | 13.2787 KOps/s | |
test_items_stack_nested_locked | 0.6404ms | 0.3491ms | 2.8644 KOps/s | 2.8989 KOps/s | |
test_keys | 20.8090μs | 3.5548μs | 281.3118 KOps/s | 277.5179 KOps/s | |
test_keys_nested | 0.2958ms | 0.1361ms | 7.3454 KOps/s | 7.1907 KOps/s | |
test_keys_nested_locked | 2.1816ms | 0.1414ms | 7.0706 KOps/s | 7.0014 KOps/s | |
test_keys_nested_leaf | 0.1982ms | 0.1171ms | 8.5379 KOps/s | 8.6437 KOps/s | |
test_keys_stack_nested | 0.2322ms | 0.1348ms | 7.4195 KOps/s | 7.3370 KOps/s | |
test_keys_stack_nested_leaf | 0.1925ms | 0.1157ms | 8.6451 KOps/s | 8.6175 KOps/s | |
test_keys_stack_nested_locked | 0.2383ms | 0.1412ms | 7.0803 KOps/s | 7.0554 KOps/s | |
test_values | 9.6624μs | 1.0337μs | 967.4235 KOps/s | 964.4775 KOps/s | |
test_values_nested | 0.1324ms | 55.8405μs | 17.9082 KOps/s | 17.9561 KOps/s | |
test_values_nested_locked | 0.1077ms | 55.6803μs | 17.9597 KOps/s | 18.1171 KOps/s | |
test_values_nested_leaf | 0.1206ms | 60.7518μs | 16.4604 KOps/s | 15.9033 KOps/s | |
test_values_stack_nested | 0.1166ms | 57.3568μs | 17.4347 KOps/s | 16.2986 KOps/s | |
test_values_stack_nested_leaf | 0.1178ms | 60.1625μs | 16.6216 KOps/s | 16.6550 KOps/s | |
test_values_stack_nested_locked | 0.1599ms | 57.3552μs | 17.4352 KOps/s | 17.4837 KOps/s | |
test_membership | 13.7960μs | 0.9155μs | 1.0923 MOps/s | 1.1161 MOps/s | |
test_membership_nested | 54.3720μs | 2.8734μs | 348.0249 KOps/s | 365.8537 KOps/s | |
test_membership_nested_leaf | 67.8160μs | 2.8163μs | 355.0761 KOps/s | 363.0575 KOps/s | |
test_membership_stacked_nested | 47.6190μs | 2.7971μs | 357.5142 KOps/s | 367.6419 KOps/s | |
test_membership_stacked_nested_leaf | 17.9730μs | 2.8167μs | 355.0203 KOps/s | 364.3280 KOps/s | |
test_membership_nested_last | 79.2980μs | 4.2254μs | 236.6658 KOps/s | 242.2528 KOps/s | |
test_membership_nested_leaf_last | 50.7950μs | 4.1805μs | 239.2070 KOps/s | 240.8631 KOps/s | |
test_membership_stacked_nested_last | 23.0330μs | 4.1402μs | 241.5324 KOps/s | 161.8911 KOps/s | |
test_membership_stacked_nested_leaf_last | 26.9400μs | 4.0979μs | 244.0268 KOps/s | 163.2636 KOps/s | |
test_nested_getleaf | 41.4870μs | 10.9034μs | 91.7142 KOps/s | 94.6388 KOps/s | |
test_nested_get | 54.8020μs | 10.3378μs | 96.7327 KOps/s | 100.1605 KOps/s | |
test_stacked_getleaf | 53.8300μs | 10.8910μs | 91.8190 KOps/s | 94.5992 KOps/s | |
test_stacked_get | 95.7980μs | 10.3595μs | 96.5293 KOps/s | 100.4727 KOps/s | |
test_nested_getitemleaf | 64.7920μs | 11.7518μs | 85.0932 KOps/s | 90.5349 KOps/s | |
test_nested_getitem | 53.0490μs | 10.4579μs | 95.6215 KOps/s | 96.6036 KOps/s | |
test_stacked_getitemleaf | 30.8580μs | 11.1400μs | 89.7668 KOps/s | 91.2541 KOps/s | |
test_stacked_getitem | 64.3390μs | 10.5242μs | 95.0188 KOps/s | 97.6026 KOps/s | |
test_lock_nested | 4.5521ms | 0.4569ms | 2.1888 KOps/s | 2.1909 KOps/s | |
test_lock_stack_nested | 0.6290ms | 0.4118ms | 2.4285 KOps/s | 2.4140 KOps/s | |
test_unlock_nested | 0.9928ms | 0.3764ms | 2.6569 KOps/s | 2.6577 KOps/s | |
test_unlock_stack_nested | 0.5151ms | 0.3324ms | 3.0082 KOps/s | 2.9950 KOps/s | |
test_flatten_speed | 0.1661ms | 93.0623μs | 10.7455 KOps/s | 10.6870 KOps/s | |
test_unflatten_speed | 0.7848ms | 0.4652ms | 2.1497 KOps/s | 2.1563 KOps/s | |
test_common_ops | 4.1601ms | 0.7780ms | 1.2853 KOps/s | 1.1893 KOps/s | |
test_creation | 40.1520μs | 2.1016μs | 475.8272 KOps/s | 465.9196 KOps/s | |
test_creation_empty | 45.6350μs | 10.1683μs | 98.3449 KOps/s | 73.8043 KOps/s | |
test_creation_nested_1 | 50.0330μs | 12.7488μs | 78.4388 KOps/s | 60.9607 KOps/s | |
test_creation_nested_2 | 54.1710μs | 17.0649μs | 58.5997 KOps/s | 48.0597 KOps/s | |
test_clone | 0.1666ms | 13.2899μs | 75.2453 KOps/s | 74.3197 KOps/s | |
test_getitem[int] | 0.9796ms | 12.7452μs | 78.4612 KOps/s | 77.9628 KOps/s | |
test_getitem[slice_int] | 0.1634ms | 25.1628μs | 39.7412 KOps/s | 40.1837 KOps/s | |
test_getitem[range] | 0.3265ms | 50.1391μs | 19.9445 KOps/s | 20.4085 KOps/s | |
test_getitem[tuple] | 0.1618ms | 20.2745μs | 49.3229 KOps/s | 48.9412 KOps/s | |
test_getitem[list] | 0.4190ms | 46.4827μs | 21.5134 KOps/s | 22.2251 KOps/s | |
test_setitem_dim[int] | 54.8620μs | 26.0368μs | 38.4071 KOps/s | 39.1376 KOps/s | |
test_setitem_dim[slice_int] | 0.1376ms | 54.6514μs | 18.2978 KOps/s | 19.4438 KOps/s | |
test_setitem_dim[range] | 0.1287ms | 76.8670μs | 13.0095 KOps/s | 13.3370 KOps/s | |
test_setitem_dim[tuple] | 93.8050μs | 41.8070μs | 23.9194 KOps/s | 24.5067 KOps/s | |
test_setitem | 0.1742ms | 19.9506μs | 50.1238 KOps/s | 46.1868 KOps/s | |
test_set | 0.1693ms | 19.1202μs | 52.3008 KOps/s | 47.6706 KOps/s | |
test_set_shared | 2.4304ms | 0.1779ms | 5.6216 KOps/s | 5.7713 KOps/s | |
test_update | 0.1907ms | 21.6371μs | 46.2170 KOps/s | 40.1972 KOps/s | |
test_update_nested | 0.2037ms | 32.0851μs | 31.1671 KOps/s | 28.8216 KOps/s | |
test_update__nested | 1.0777ms | 33.2096μs | 30.1118 KOps/s | 30.9739 KOps/s | |
test_set_nested | 0.1950ms | 21.7496μs | 45.9779 KOps/s | 43.5638 KOps/s | |
test_set_nested_new | 0.2045ms | 26.3838μs | 37.9021 KOps/s | 36.2456 KOps/s | |
test_select | 0.2191ms | 42.1569μs | 23.7209 KOps/s | 22.3241 KOps/s | |
test_select_nested | 0.1321ms | 60.2389μs | 16.6006 KOps/s | 16.7526 KOps/s | |
test_exclude_nested | 0.3692ms | 75.0010μs | 13.3331 KOps/s | 13.4738 KOps/s | |
test_empty[True] | 0.6963ms | 0.3487ms | 2.8680 KOps/s | 2.8529 KOps/s | |
test_empty[False] | 12.1025μs | 1.2943μs | 772.6311 KOps/s | 828.5754 KOps/s | |
test_unbind_speed | 0.5738ms | 0.2679ms | 3.7326 KOps/s | 3.8242 KOps/s | |
test_unbind_speed_stack0 | 0.4032ms | 0.2575ms | 3.8840 KOps/s | 3.8936 KOps/s | |
test_unbind_speed_stack1 | 0.1156s | 0.7802ms | 1.2816 KOps/s | 1.4130 KOps/s | |
test_split | 0.1111s | 1.7632ms | 567.1527 Ops/s | 552.4067 Ops/s | |
test_chunk | 1.7568ms | 1.6016ms | 624.3837 Ops/s | 552.8162 Ops/s | |
test_consolidate_njt[False-None] | 0.1243s | 9.2964ms | 107.5680 Ops/s | 114.8861 Ops/s | |
test_creation[device0] | 0.2862ms | 92.6147μs | 10.7974 KOps/s | 10.7795 KOps/s | |
test_creation_from_tensor | 4.8996ms | 96.6267μs | 10.3491 KOps/s | 10.2410 KOps/s | |
test_add_one[memmap_tensor0] | 0.2568ms | 4.9067μs | 203.8023 KOps/s | 207.0933 KOps/s | |
test_contiguous[memmap_tensor0] | 12.8040μs | 0.5011μs | 1.9955 MOps/s | 1.9848 MOps/s | |
test_stack[memmap_tensor0] | 73.9080μs | 3.4097μs | 293.2849 KOps/s | 288.1371 KOps/s | |
test_memmaptd_index | 1.2916ms | 0.2445ms | 4.0900 KOps/s | 4.1464 KOps/s | |
test_memmaptd_index_astensor | 0.6889ms | 0.3225ms | 3.1011 KOps/s | 3.1202 KOps/s | |
test_memmaptd_index_op | 1.1671ms | 0.5781ms | 1.7299 KOps/s | 1.5815 KOps/s | |
test_serialize_model | 0.1244s | 0.1201s | 8.3248 Ops/s | 8.4042 Ops/s | |
test_serialize_model_pickle | 0.4460s | 0.3986s | 2.5089 Ops/s | 2.4888 Ops/s | |
test_serialize_weights | 0.2371s | 0.1348s | 7.4207 Ops/s | 7.2356 Ops/s | |
test_serialize_weights_returnearly | 0.3144s | 0.1799s | 5.5579 Ops/s | 6.2258 Ops/s | |
test_serialize_weights_pickle | 0.4568s | 0.4048s | 2.4705 Ops/s | 2.4051 Ops/s | |
test_serialize_weights_filesystem | 0.1497s | 0.1467s | 6.8185 Ops/s | 6.6930 Ops/s | |
test_serialize_model_filesystem | 0.1624s | 0.1533s | 6.5236 Ops/s | 5.6051 Ops/s | |
test_reshape_pytree | 67.4750μs | 26.5126μs | 37.7179 KOps/s | 36.8825 KOps/s | |
test_reshape_td | 89.0060μs | 32.2845μs | 30.9746 KOps/s | 30.6930 KOps/s | |
test_view_pytree | 69.0090μs | 26.4607μs | 37.7919 KOps/s | 36.8940 KOps/s | |
test_view_td | 0.1091ms | 37.4728μs | 26.6860 KOps/s | 26.8158 KOps/s | |
test_unbind_pytree | 75.8510μs | 29.9130μs | 33.4302 KOps/s | 33.1698 KOps/s | |
test_unbind_td | 0.3455ms | 38.5605μs | 25.9333 KOps/s | 26.0113 KOps/s | |
test_split_pytree | 0.1061ms | 29.7535μs | 33.6095 KOps/s | 33.8923 KOps/s | |
test_split_td | 0.1216s | 56.9765μs | 17.5511 KOps/s | 22.5205 KOps/s | |
test_add_pytree | 0.1100ms | 36.4329μs | 27.4477 KOps/s | 27.3969 KOps/s | |
test_add_td | 0.1704ms | 52.8631μs | 18.9168 KOps/s | 15.9509 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1501ms | 68.3839μs | 14.6233 KOps/s | 15.5390 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.4191ms | 0.1634ms | 6.1212 KOps/s | 6.1853 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1307ms | 47.5273μs | 21.0406 KOps/s | 21.0838 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2315ms | 0.1188ms | 8.4183 KOps/s | 8.3454 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 88.1330μs | 26.6512μs | 37.5217 KOps/s | 37.7653 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1220ms | 54.0378μs | 18.5056 KOps/s | 18.4063 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1486ms | 78.8938μs | 12.6753 KOps/s | 12.5503 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1553ms | 68.4044μs | 14.6190 KOps/s | 14.5761 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1949ms | 0.1079ms | 9.2703 KOps/s | 9.3163 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4169ms | 0.2017ms | 4.9580 KOps/s | 5.0601 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1304ms | 47.4954μs | 21.0547 KOps/s | 21.8301 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.5040ms | 63.4874μs | 15.7512 KOps/s | 16.1348 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2084ms | 0.1061ms | 9.4276 KOps/s | 9.5870 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3590ms | 0.1981ms | 5.0482 KOps/s | 4.8217 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4241ms | 0.2134ms | 4.6871 KOps/s | 4.7379 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2283ms | 0.1090ms | 9.1771 KOps/s | 9.3286 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2071ms | 56.0511μs | 17.8409 KOps/s | 18.3696 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1221ms | 49.3955μs | 20.2448 KOps/s | 21.5297 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.3196ms | 0.1612ms | 6.2044 KOps/s | 6.1515 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 6.2805ms | 0.1083ms | 9.2355 KOps/s | 9.6845 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 72.4950μs | 21.3822μs | 46.7679 KOps/s | 46.8468 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1245ms | 58.0931μs | 17.2137 KOps/s | 16.4221 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1601ms | 81.3600μs | 12.2910 KOps/s | 11.8786 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1779ms | 68.5043μs | 14.5976 KOps/s | 14.1987 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3435ms | 0.2123ms | 4.7093 KOps/s | 4.7466 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.6355ms | 1.3037ms | 767.0606 Ops/s | 755.3999 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3171ms | 0.2094ms | 4.7763 KOps/s | 4.8708 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.2232ms | 0.7847ms | 1.2743 KOps/s | 1.2552 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.7043ms | 0.4770ms | 2.0964 KOps/s | 2.1765 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.3333ms | 2.6155ms | 382.3375 Ops/s | 253.0705 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 99.4250μs | 37.8404μs | 26.4268 KOps/s | 26.5267 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.6294ms | 32.1587μs | 31.0958 KOps/s | 29.0060 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1079ms | 30.3884μs | 32.9073 KOps/s | 32.4289 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1843ms | 23.5482μs | 42.4661 KOps/s | 43.2588 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 76.6920μs | 30.9327μs | 32.3283 KOps/s | 31.5490 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 73.8280μs | 23.0981μs | 43.2936 KOps/s | 42.5197 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2458ms | 54.0521μs | 18.5007 KOps/s | 18.7416 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.4639ms | 19.4130μs | 51.5119 KOps/s | 47.9488 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1188ms | 46.0924μs | 21.6955 KOps/s | 21.8231 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 74.8690μs | 18.4250μs | 54.2742 KOps/s | 51.8297 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1260ms | 46.5914μs | 21.4632 KOps/s | 21.3253 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 60.6230μs | 18.5246μs | 53.9823 KOps/s | 52.6876 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2626ms | 55.6508μs | 17.9692 KOps/s | 18.3535 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8659ms | 19.4148μs | 51.5070 KOps/s | 50.2521 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1936ms | 46.6390μs | 21.4413 KOps/s | 21.5567 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 62.5160μs | 18.6534μs | 53.6096 KOps/s | 53.2360 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1341ms | 46.7815μs | 21.3760 KOps/s | 21.5175 KOps/s | |
test_compile_indexing[int-pytree-eager] | 82.6140μs | 18.5584μs | 53.8841 KOps/s | 53.3345 KOps/s | |
test_mod_add[eager] | 0.1031ms | 25.7915μs | 38.7725 KOps/s | 35.4727 KOps/s | |
test_mod_add[compile] | 0.1921ms | 47.5040μs | 21.0508 KOps/s | 21.6099 KOps/s | |
test_mod_add[compile-overhead] | 0.1156ms | 47.9819μs | 20.8412 KOps/s | 21.4610 KOps/s | |
test_mod_wrap[eager] | 0.3990ms | 0.2263ms | 4.4181 KOps/s | 4.3700 KOps/s | |
test_mod_wrap[compile] | 2.0455ms | 0.2106ms | 4.7476 KOps/s | 4.7182 KOps/s | |
test_mod_wrap[compile-overhead] | 2.3272ms | 0.2096ms | 4.7710 KOps/s | 4.7823 KOps/s | |
test_mod_wrap_and_backward[eager] | 21.8281ms | 12.5535ms | 79.6588 Ops/s | 82.6943 Ops/s | |
test_mod_wrap_and_backward[compile] | 16.9114ms | 13.2396ms | 75.5308 Ops/s | 77.5451 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 17.0945ms | 13.5140ms | 73.9976 Ops/s | 73.7779 Ops/s | |
test_seq_add[eager] | 0.1843ms | 91.7857μs | 10.8949 KOps/s | 10.1461 KOps/s | |
test_seq_add[compile] | 0.1291ms | 64.9900μs | 15.3870 KOps/s | 16.0402 KOps/s | |
test_seq_add[compile-overhead] | 0.1448ms | 62.2994μs | 16.0515 KOps/s | 16.7222 KOps/s | |
test_seq_wrap[eager] | 0.6058ms | 0.4029ms | 2.4819 KOps/s | 2.3884 KOps/s | |
test_seq_wrap[compile] | 0.4566ms | 0.2357ms | 4.2436 KOps/s | 4.2388 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4703ms | 0.2365ms | 4.2276 KOps/s | 4.2927 KOps/s | |
test_func_call_runtime[False-eager] | 1.1101ms | 0.5941ms | 1.6834 KOps/s | 1.7177 KOps/s | |
test_func_call_runtime[False-compile] | 0.7037ms | 0.4371ms | 2.2876 KOps/s | 2.2660 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8793ms | 0.4398ms | 2.2738 KOps/s | 2.3026 KOps/s | |
test_func_call_runtime[True-eager] | 1.1932ms | 0.8033ms | 1.2449 KOps/s | 1.2610 KOps/s | |
test_func_call_runtime[True-compile] | 0.6779ms | 0.4788ms | 2.0884 KOps/s | 2.1045 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6820ms | 0.4800ms | 2.0832 KOps/s | 2.1143 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.0650ms | 0.5815ms | 1.7196 KOps/s | 1.7456 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8856ms | 0.4355ms | 2.2960 KOps/s | 2.3124 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5889ms | 0.4376ms | 2.2854 KOps/s | 2.2878 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4969ms | 0.9417ms | 1.0619 KOps/s | 1.0790 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.7875ms | 0.5084ms | 1.9668 KOps/s | 1.9948 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.7033ms | 0.5032ms | 1.9874 KOps/s | 1.9979 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6863ms | 1.9845ms | 503.9135 Ops/s | 505.2186 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.0814ms | 0.5390ms | 1.8551 KOps/s | 1.8856 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7173ms | 0.5360ms | 1.8655 KOps/s | 1.8482 KOps/s | |
test_distributed | 0.4918ms | 0.1268ms | 7.8890 KOps/s | 7.6506 KOps/s | |
test_tdmodule | 72.1840μs | 20.0631μs | 49.8427 KOps/s | 47.6101 KOps/s | |
test_tdmodule_dispatch | 0.1061ms | 35.4651μs | 28.1967 KOps/s | 24.1775 KOps/s | |
test_tdseq | 41.6580μs | 21.2308μs | 47.1013 KOps/s | 41.5570 KOps/s | |
test_tdseq_dispatch | 86.1810μs | 42.6418μs | 23.4512 KOps/s | 21.2705 KOps/s | |
test_instantiation_functorch | 1.9008ms | 1.5620ms | 640.2108 Ops/s | 639.6578 Ops/s | |
test_exec_functorch | 0.3375ms | 0.1861ms | 5.3739 KOps/s | 5.4799 KOps/s | |
test_exec_functional_call | 0.4538ms | 0.1844ms | 5.4240 KOps/s | 5.8038 KOps/s | |
test_exec_td_decorator | 0.6317ms | 0.2402ms | 4.1635 KOps/s | 4.4160 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.1257ms | 0.6594ms | 1.5166 KOps/s | 1.5479 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9760ms | 0.6598ms | 1.5155 KOps/s | 1.5447 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7972ms | 0.5392ms | 1.8546 KOps/s | 1.9011 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7777ms | 0.5368ms | 1.8628 KOps/s | 1.9011 KOps/s | |
test_to_module_speed[True] | 2.1121ms | 1.3104ms | 763.1264 Ops/s | 761.2349 Ops/s | |
test_to_module_speed[False] | 2.0077ms | 1.2643ms | 790.9234 Ops/s | 793.2066 Ops/s | |
test_tc_init | 0.1241ms | 43.0278μs | 23.2408 KOps/s | 20.4240 KOps/s | |
test_tc_init_nested | 0.1394ms | 85.2975μs | 11.7237 KOps/s | 10.0357 KOps/s | |
test_tc_first_layer_tensor | 18.2340μs | 1.5240μs | 656.1827 KOps/s | 658.4102 KOps/s | |
test_tc_first_layer_nontensor | 33.5120μs | 4.7058μs | 212.5046 KOps/s | 206.1823 KOps/s | |
test_tc_second_layer_tensor | 28.1020μs | 2.8122μs | 355.5945 KOps/s | 361.3598 KOps/s | |
test_tc_second_layer_nontensor | 79.3390μs | 5.9992μs | 166.6896 KOps/s | 162.8665 KOps/s | |
test_unbind | 0.2523s | 14.3922ms | 69.4820 Ops/s | 65.9153 Ops/s | |
test_full_like | 11.3466ms | 9.1762ms | 108.9776 Ops/s | 103.3152 Ops/s | |
test_zeros_like | 5.6351ms | 3.5556ms | 281.2472 Ops/s | 280.7879 Ops/s | |
test_ones_like | 4.9040ms | 4.0107ms | 249.3299 Ops/s | 243.1663 Ops/s | |
test_clone | 9.0144ms | 6.3598ms | 157.2378 Ops/s | 155.2748 Ops/s | |
test_squeeze | 69.5900μs | 11.7043μs | 85.4390 KOps/s | 84.7411 KOps/s | |
test_unsqueeze | 0.1641ms | 88.6437μs | 11.2811 KOps/s | 11.4003 KOps/s | |
test_split | 0.5320ms | 0.1929ms | 5.1845 KOps/s | 5.0797 KOps/s | |
test_permute | 0.4649ms | 0.2299ms | 4.3498 KOps/s | 4.4731 KOps/s | |
test_stack | 31.4410ms | 27.7327ms | 36.0585 Ops/s | 35.7753 Ops/s | |
test_cat | 29.6466ms | 27.3724ms | 36.5331 Ops/s | 35.7598 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 36.5210μs | 11.4003μs | 87.7172 KOps/s | 88.8863 KOps/s | |
test_plain_set_stack_nested | 35.2510μs | 11.5148μs | 86.8444 KOps/s | 88.7211 KOps/s | |
test_plain_set_nested_inplace | 63.7910μs | 12.2496μs | 81.6350 KOps/s | 82.5517 KOps/s | |
test_plain_set_stack_nested_inplace | 0.3978ms | 12.3076μs | 81.2507 KOps/s | 82.4431 KOps/s | |
test_items | 25.7900μs | 2.8640μs | 349.1650 KOps/s | 338.9081 KOps/s | |
test_items_nested | 0.6874ms | 0.3206ms | 3.1194 KOps/s | 3.1229 KOps/s | |
test_items_nested_locked | 0.6959ms | 0.3246ms | 3.0811 KOps/s | 3.0871 KOps/s | |
test_items_nested_leaf | 0.4341ms | 58.5068μs | 17.0920 KOps/s | 17.1363 KOps/s | |
test_items_stack_nested | 0.7034ms | 0.3217ms | 3.1085 KOps/s | 3.1356 KOps/s | |
test_items_stack_nested_leaf | 91.5820μs | 58.3266μs | 17.1448 KOps/s | 17.1899 KOps/s | |
test_items_stack_nested_locked | 0.6995ms | 0.3222ms | 3.1036 KOps/s | 3.0902 KOps/s | |
test_keys | 0.3757ms | 3.4712μs | 288.0824 KOps/s | 288.1452 KOps/s | |
test_keys_nested | 0.4454ms | 70.8322μs | 14.1179 KOps/s | 14.0773 KOps/s | |
test_keys_nested_locked | 2.5264ms | 76.0468μs | 13.1498 KOps/s | 13.0434 KOps/s | |
test_keys_nested_leaf | 0.4306ms | 62.0045μs | 16.1279 KOps/s | 16.0068 KOps/s | |
test_keys_stack_nested | 0.1073ms | 70.9416μs | 14.0961 KOps/s | 13.9562 KOps/s | |
test_keys_stack_nested_leaf | 0.4317ms | 61.6166μs | 16.2294 KOps/s | 16.0843 KOps/s | |
test_keys_stack_nested_locked | 0.4499ms | 75.7353μs | 13.2039 KOps/s | 12.9058 KOps/s | |
test_values | 62.7178μs | 0.8408μs | 1.1894 MOps/s | 1.1733 MOps/s | |
test_values_nested | 0.4191ms | 31.1509μs | 32.1018 KOps/s | 31.8012 KOps/s | |
test_values_nested_locked | 0.4069ms | 33.8426μs | 29.5486 KOps/s | 30.4959 KOps/s | |
test_values_nested_leaf | 0.4251ms | 33.8485μs | 29.5434 KOps/s | 29.4742 KOps/s | |
test_values_stack_nested | 64.2310μs | 31.3530μs | 31.8949 KOps/s | 31.7744 KOps/s | |
test_values_stack_nested_leaf | 0.4221ms | 34.0596μs | 29.3603 KOps/s | 29.1096 KOps/s | |
test_values_stack_nested_locked | 0.4076ms | 34.1371μs | 29.2936 KOps/s | 30.2508 KOps/s | |
test_membership | 19.0554μs | 0.5115μs | 1.9552 MOps/s | 1.9640 MOps/s | |
test_membership_nested | 0.1945ms | 1.8853μs | 530.4245 KOps/s | 536.6182 KOps/s | |
test_membership_nested_leaf | 16.2505μs | 1.8829μs | 531.0844 KOps/s | 537.5625 KOps/s | |
test_membership_stacked_nested | 32.6200μs | 1.9561μs | 511.2146 KOps/s | 512.7883 KOps/s | |
test_membership_stacked_nested_leaf | 28.5300μs | 1.9600μs | 510.2084 KOps/s | 507.5000 KOps/s | |
test_membership_nested_last | 0.4005ms | 2.7916μs | 358.2169 KOps/s | 360.0025 KOps/s | |
test_membership_nested_leaf_last | 34.7310μs | 2.7942μs | 357.8849 KOps/s | 357.9707 KOps/s | |
test_membership_stacked_nested_last | 25.5010μs | 2.8185μs | 354.7966 KOps/s | 359.0390 KOps/s | |
test_membership_stacked_nested_leaf_last | 0.4034ms | 2.7618μs | 362.0853 KOps/s | 358.2217 KOps/s | |
test_nested_getleaf | 38.8910μs | 5.9779μs | 167.2828 KOps/s | 166.8331 KOps/s | |
test_nested_get | 46.4310μs | 5.6920μs | 175.6843 KOps/s | 175.7490 KOps/s | |
test_stacked_getleaf | 41.8300μs | 5.9973μs | 166.7414 KOps/s | 167.3772 KOps/s | |
test_stacked_get | 57.0310μs | 5.6486μs | 177.0352 KOps/s | 175.5337 KOps/s | |
test_nested_getitemleaf | 30.4810μs | 6.1045μs | 163.8129 KOps/s | 164.1673 KOps/s | |
test_nested_getitem | 57.3810μs | 5.7435μs | 174.1087 KOps/s | 172.4887 KOps/s | |
test_stacked_getitemleaf | 64.9510μs | 6.0697μs | 164.7540 KOps/s | 164.2418 KOps/s | |
test_stacked_getitem | 32.0910μs | 5.7492μs | 173.9387 KOps/s | 171.8004 KOps/s | |
test_lock_nested | 6.8105ms | 0.3740ms | 2.6740 KOps/s | 2.7449 KOps/s | |
test_lock_stack_nested | 0.4830ms | 0.3365ms | 2.9716 KOps/s | 2.9971 KOps/s | |
test_unlock_nested | 0.6676ms | 0.3113ms | 3.2125 KOps/s | 3.3462 KOps/s | |
test_unlock_stack_nested | 0.3361ms | 0.2777ms | 3.6008 KOps/s | 3.6944 KOps/s | |
test_flatten_speed | 0.4514ms | 73.5657μs | 13.5933 KOps/s | 13.6404 KOps/s | |
test_unflatten_speed | 0.6687ms | 0.2942ms | 3.3989 KOps/s | 3.4419 KOps/s | |
test_common_ops | 1.7210ms | 0.6073ms | 1.6467 KOps/s | 1.6938 KOps/s | |
test_creation | 0.1412ms | 1.4795μs | 675.8919 KOps/s | 689.1809 KOps/s | |
test_creation_empty | 41.7010μs | 9.0186μs | 110.8815 KOps/s | 117.5131 KOps/s | |
test_creation_nested_1 | 39.7700μs | 10.5829μs | 94.4921 KOps/s | 99.1936 KOps/s | |
test_creation_nested_2 | 0.3930ms | 13.3258μs | 75.0423 KOps/s | 79.3159 KOps/s | |
test_clone | 49.0110μs | 9.9921μs | 100.0787 KOps/s | 99.5146 KOps/s | |
test_getitem[int] | 1.4992ms | 10.4704μs | 95.5074 KOps/s | 95.8626 KOps/s | |
test_getitem[slice_int] | 93.0076ms | 30.4645μs | 32.8250 KOps/s | 47.5370 KOps/s | |
test_getitem[range] | 0.1342ms | 37.7214μs | 26.5102 KOps/s | 26.2012 KOps/s | |
test_getitem[tuple] | 0.1122ms | 17.9184μs | 55.8085 KOps/s | 54.5995 KOps/s | |
test_getitem[list] | 0.4203ms | 33.7833μs | 29.6004 KOps/s | 29.6793 KOps/s | |
test_setitem_dim[int] | 39.3910μs | 18.5588μs | 53.8828 KOps/s | 53.4682 KOps/s | |
test_setitem_dim[slice_int] | 64.7210μs | 37.7845μs | 26.4659 KOps/s | 26.4479 KOps/s | |
test_setitem_dim[range] | 78.5510μs | 53.9796μs | 18.5255 KOps/s | 18.5316 KOps/s | |
test_setitem_dim[tuple] | 54.9320μs | 32.1305μs | 31.1230 KOps/s | 31.1684 KOps/s | |
test_setitem | 81.2410μs | 15.4345μs | 64.7901 KOps/s | 68.5058 KOps/s | |
test_set | 0.4017ms | 14.8829μs | 67.1912 KOps/s | 70.9847 KOps/s | |
test_set_shared | 1.6201ms | 0.1462ms | 6.8390 KOps/s | 6.8211 KOps/s | |
test_update | 0.2505ms | 17.9801μs | 55.6169 KOps/s | 56.8144 KOps/s | |
test_update_nested | 0.4132ms | 22.4025μs | 44.6378 KOps/s | 45.2886 KOps/s | |
test_update__nested | 1.1490ms | 24.3304μs | 41.1008 KOps/s | 43.7552 KOps/s | |
test_set_nested | 85.3120μs | 15.4645μs | 64.6640 KOps/s | 65.1606 KOps/s | |
test_set_nested_new | 0.3987ms | 17.8393μs | 56.0560 KOps/s | 55.7556 KOps/s | |
test_select | 91.1220μs | 29.7398μs | 33.6250 KOps/s | 33.4284 KOps/s | |
test_select_nested | 0.4199ms | 41.9564μs | 23.8343 KOps/s | 23.9190 KOps/s | |
test_exclude_nested | 0.4371ms | 59.9344μs | 16.6849 KOps/s | 16.8598 KOps/s | |
test_empty[True] | 0.6347ms | 0.2580ms | 3.8767 KOps/s | 3.9055 KOps/s | |
test_empty[False] | 38.4587μs | 0.7474μs | 1.3380 MOps/s | 1.3214 MOps/s | |
test_to | 85.7810μs | 59.1624μs | 16.9026 KOps/s | 16.7493 KOps/s | |
test_to_nonblocking | 97.1820μs | 53.8961μs | 18.5542 KOps/s | 19.2731 KOps/s | |
test_unbind_speed | 0.6202ms | 0.2328ms | 4.2959 KOps/s | 4.4062 KOps/s | |
test_unbind_speed_stack0 | 0.6199ms | 0.2341ms | 4.2715 KOps/s | 4.4449 KOps/s | |
test_unbind_speed_stack1 | 93.6649ms | 0.6554ms | 1.5258 KOps/s | 1.6866 KOps/s | |
test_split | 94.2928ms | 1.7478ms | 572.1594 Ops/s | 627.8630 Ops/s | |
test_chunk | 1.8531ms | 1.4823ms | 674.6465 Ops/s | 627.4971 Ops/s | |
test_consolidate[False-None] | 97.4791ms | 2.9050ms | 344.2297 Ops/s | 343.0403 Ops/s | |
test_consolidate[default-None] | 1.7627ms | 1.6660ms | 600.2418 Ops/s | 599.3785 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.7661ms | 1.6974ms | 589.1345 Ops/s | 589.7977 Ops/s | |
test_consolidate_njt[False-None] | 6.8196ms | 6.7035ms | 149.1749 Ops/s | 149.5605 Ops/s | |
test_to[False-False-None] | 2.2710ms | 2.1771ms | 459.3249 Ops/s | 426.3046 Ops/s | |
test_to[True-False-None] | 1.4123ms | 1.3089ms | 763.9938 Ops/s | 763.7473 Ops/s | |
test_to[within-False-None] | 4.3429ms | 4.0856ms | 244.7626 Ops/s | 245.1965 Ops/s | |
test_to[True-default-None] | 5.3231ms | 5.1152ms | 195.4960 Ops/s | 191.9982 Ops/s | |
test_to_njt[False-False-None] | 7.9012ms | 7.7057ms | 129.7733 Ops/s | 130.3640 Ops/s | |
test_to_njt[True-False-None] | 5.8083ms | 5.6789ms | 176.0906 Ops/s | 181.0595 Ops/s | |
test_to_njt[within-False-None] | 12.6167ms | 12.3350ms | 81.0703 Ops/s | 80.5804 Ops/s | |
test_creation[device0] | 0.3736ms | 78.5096μs | 12.7373 KOps/s | 12.5485 KOps/s | |
test_creation_from_tensor | 0.4947ms | 82.9810μs | 12.0509 KOps/s | 11.9139 KOps/s | |
test_add_one[memmap_tensor0] | 0.2322ms | 6.4945μs | 153.9763 KOps/s | 148.0090 KOps/s | |
test_contiguous[memmap_tensor0] | 1.8381μs | 0.4268μs | 2.3430 MOps/s | 2.3423 MOps/s | |
test_stack[memmap_tensor0] | 26.9900μs | 4.5408μs | 220.2247 KOps/s | 221.2852 KOps/s | |
test_memmaptd_index | 1.9596ms | 0.2485ms | 4.0246 KOps/s | 3.9147 KOps/s | |
test_memmaptd_index_astensor | 0.5946ms | 0.3081ms | 3.2457 KOps/s | 3.1816 KOps/s | |
test_memmaptd_index_op | 1.0076ms | 0.5914ms | 1.6909 KOps/s | 1.6711 KOps/s | |
test_serialize_model | 0.1321s | 0.1309s | 7.6411 Ops/s | 5.3702 Ops/s | |
test_serialize_model_pickle | 1.3661s | 1.1933s | 0.8380 Ops/s | 0.8217 Ops/s | |
test_serialize_weights | 0.1315s | 0.1303s | 7.6767 Ops/s | 7.6887 Ops/s | |
test_serialize_weights_returnearly | 0.3773s | 57.0011ms | 17.5435 Ops/s | 23.1474 Ops/s | |
test_serialize_weights_pickle | 1.3769s | 1.2199s | 0.8197 Ops/s | 0.8196 Ops/s | |
test_reshape_pytree | 53.1510μs | 22.3570μs | 44.7286 KOps/s | 43.6599 KOps/s | |
test_reshape_td | 53.4910μs | 26.7930μs | 37.3233 KOps/s | 37.0005 KOps/s | |
test_view_pytree | 47.0700μs | 22.2026μs | 45.0398 KOps/s | 44.5480 KOps/s | |
test_view_td | 59.1710μs | 30.8367μs | 32.4289 KOps/s | 32.3507 KOps/s | |
test_unbind_pytree | 65.1210μs | 27.9410μs | 35.7897 KOps/s | 35.7776 KOps/s | |
test_unbind_td | 0.6196ms | 35.7583μs | 27.9655 KOps/s | 28.4131 KOps/s | |
test_split_pytree | 62.1720μs | 30.5602μs | 32.7223 KOps/s | 32.6248 KOps/s | |
test_split_td | 0.7825ms | 38.6669μs | 25.8619 KOps/s | 25.4350 KOps/s | |
test_add_pytree | 79.0910μs | 34.0931μs | 29.3314 KOps/s | 28.9907 KOps/s | |
test_add_td | 87.1120μs | 48.1745μs | 20.7579 KOps/s | 20.3721 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1821ms | 0.1187ms | 8.4215 KOps/s | 8.1319 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2229ms | 0.1237ms | 8.0818 KOps/s | 7.9654 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2591ms | 0.1027ms | 9.7355 KOps/s | 10.0681 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.2536ms | 0.1511ms | 6.6167 KOps/s | 6.4666 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 97.7010μs | 23.8667μs | 41.8993 KOps/s | 44.3439 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 57.2710μs | 27.0348μs | 36.9894 KOps/s | 36.7568 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1009ms | 65.2364μs | 15.3289 KOps/s | 15.4281 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.2901ms | 49.8206μs | 20.0720 KOps/s | 20.5467 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1989ms | 0.1441ms | 6.9414 KOps/s | 6.9325 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2921ms | 0.2085ms | 4.7959 KOps/s | 4.8360 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1341ms | 97.7087μs | 10.2345 KOps/s | 9.7875 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1104ms | 52.0391μs | 19.2163 KOps/s | 18.2900 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2312ms | 0.1454ms | 6.8797 KOps/s | 6.8637 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5496ms | 0.4886ms | 2.0467 KOps/s | 2.0116 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3606ms | 0.2458ms | 4.0678 KOps/s | 4.0489 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1994ms | 0.1442ms | 6.9352 KOps/s | 6.9000 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1431ms | 62.8119μs | 15.9205 KOps/s | 15.8699 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1539ms | 0.1018ms | 9.8236 KOps/s | 10.2239 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4616ms | 0.4062ms | 2.4621 KOps/s | 2.3967 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2066ms | 0.1431ms | 6.9874 KOps/s | 7.1782 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 56.9510μs | 18.8315μs | 53.1024 KOps/s | 53.3782 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 54.7810μs | 28.2366μs | 35.4151 KOps/s | 36.9522 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1922ms | 69.7919μs | 14.3283 KOps/s | 14.2423 KOps/s | |
test_compile_copy_flat[pytree-eager] | 94.9220μs | 51.6583μs | 19.3580 KOps/s | 19.2467 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6758ms | 0.4548ms | 2.1985 KOps/s | 2.2104 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7276ms | 2.6458ms | 377.9582 Ops/s | 375.2026 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6392ms | 0.4438ms | 2.2535 KOps/s | 2.2437 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8184ms | 2.6770ms | 373.5482 Ops/s | 366.8481 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.5357ms | 0.1128ms | 8.8672 KOps/s | 8.7468 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5674ms | 80.8592μs | 12.3672 KOps/s | 12.2902 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2698ms | 0.1060ms | 9.4376 KOps/s | 9.2579 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1116ms | 68.3143μs | 14.6382 KOps/s | 14.6622 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1608ms | 0.1068ms | 9.3635 KOps/s | 9.3701 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1970ms | 68.3021μs | 14.6408 KOps/s | 14.7161 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1509ms | 99.9882μs | 10.0012 KOps/s | 9.9962 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1376ms | 17.4049μs | 57.4552 KOps/s | 56.0181 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1520ms | 96.0379μs | 10.4126 KOps/s | 10.3160 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 53.0710μs | 15.9249μs | 62.7947 KOps/s | 63.6695 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1496ms | 97.2742μs | 10.2802 KOps/s | 10.3312 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 65.0020μs | 15.8450μs | 63.1115 KOps/s | 63.0814 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1604ms | 0.1017ms | 9.8319 KOps/s | 9.8554 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5583ms | 17.2350μs | 58.0216 KOps/s | 58.2966 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1469ms | 97.3659μs | 10.2705 KOps/s | 10.2432 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 54.5810μs | 16.0206μs | 62.4195 KOps/s | 63.7949 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1469ms | 97.4044μs | 10.2665 KOps/s | 10.3064 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1545ms | 19.2051μs | 52.0696 KOps/s | 63.9339 KOps/s | |
test_mod_add[eager] | 74.3320μs | 32.2893μs | 30.9700 KOps/s | 31.1328 KOps/s | |
test_mod_add[compile] | 0.2620ms | 80.5249μs | 12.4185 KOps/s | 12.6275 KOps/s | |
test_mod_add[compile-overhead] | 0.3158ms | 0.1639ms | 6.1012 KOps/s | 5.7740 KOps/s | |
test_mod_wrap[eager] | 0.3870ms | 0.2504ms | 3.9935 KOps/s | 3.9670 KOps/s | |
test_mod_wrap[compile] | 0.3745ms | 0.2872ms | 3.4820 KOps/s | 3.4595 KOps/s | |
test_mod_wrap[compile-overhead] | 7.9367ms | 4.0192ms | 248.8046 Ops/s | 247.2261 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6819ms | 1.4756ms | 677.7090 Ops/s | 685.3810 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.5043ms | 1.3833ms | 722.9260 Ops/s | 718.2080 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.5547ms | 1.0349ms | 966.2945 Ops/s | 953.4488 Ops/s | |
test_seq_add[eager] | 0.1393ms | 0.1027ms | 9.7390 KOps/s | 10.3276 KOps/s | |
test_seq_add[compile] | 0.1556ms | 88.0453μs | 11.3578 KOps/s | 11.3594 KOps/s | |
test_seq_add[compile-overhead] | 0.1831ms | 0.1291ms | 7.7476 KOps/s | 7.7411 KOps/s | |
test_seq_wrap[eager] | 0.4657ms | 0.3884ms | 2.5749 KOps/s | 2.5343 KOps/s | |
test_seq_wrap[compile] | 0.3683ms | 0.3018ms | 3.3139 KOps/s | 3.2874 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3046ms | 0.2313ms | 4.3242 KOps/s | 4.4660 KOps/s | |
test_func_call_runtime[False-eager] | 0.8769ms | 0.8055ms | 1.2414 KOps/s | 1.3100 KOps/s | |
test_func_call_runtime[False-compile] | 0.9361ms | 0.7986ms | 1.2522 KOps/s | 1.3222 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4376ms | 0.3700ms | 2.7025 KOps/s | 2.7387 KOps/s | |
test_func_call_runtime[True-eager] | 1.0151ms | 0.9154ms | 1.0924 KOps/s | 1.0639 KOps/s | |
test_func_call_runtime[True-compile] | 0.8929ms | 0.8258ms | 1.2110 KOps/s | 1.2827 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4558ms | 0.3865ms | 2.5876 KOps/s | 2.5674 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9250ms | 0.7952ms | 1.2576 KOps/s | 1.2378 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8286ms | 0.7513ms | 1.3309 KOps/s | 1.3231 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4224ms | 0.3673ms | 2.7222 KOps/s | 2.7215 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1365ms | 1.0114ms | 988.7030 Ops/s | 974.9448 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9158ms | 0.8228ms | 1.2154 KOps/s | 1.2283 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4687ms | 0.4137ms | 2.4174 KOps/s | 2.3986 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5268ms | 2.0955ms | 477.2112 Ops/s | 469.2278 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.0008ms | 0.8662ms | 1.1544 KOps/s | 1.2147 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4770ms | 0.4129ms | 2.4220 KOps/s | 2.3952 KOps/s | |
test_distributed | 0.7311ms | 0.1229ms | 8.1395 KOps/s | 8.8847 KOps/s | |
test_tdmodule | 0.5811ms | 14.8169μs | 67.4905 KOps/s | 71.3256 KOps/s | |
test_tdmodule_dispatch | 47.7310μs | 28.5914μs | 34.9756 KOps/s | 34.8897 KOps/s | |
test_tdseq | 36.9310μs | 15.7977μs | 63.3002 KOps/s | 64.1435 KOps/s | |
test_tdseq_dispatch | 54.4610μs | 31.9125μs | 31.3357 KOps/s | 31.7148 KOps/s | |
test_instantiation_functorch | 1.7591ms | 1.5710ms | 636.5536 Ops/s | 635.3100 Ops/s | |
test_exec_functorch | 0.2068ms | 0.1467ms | 6.8168 KOps/s | 6.6727 KOps/s | |
test_exec_functional_call | 0.2113ms | 0.1418ms | 7.0508 KOps/s | 6.9412 KOps/s | |
test_exec_td_decorator | 0.3860ms | 0.1908ms | 5.2422 KOps/s | 5.2862 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7562ms | 0.6804ms | 1.4697 KOps/s | 1.4546 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8568ms | 0.6845ms | 1.4610 KOps/s | 1.4540 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7134ms | 0.6082ms | 1.6443 KOps/s | 1.6596 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7862ms | 0.6179ms | 1.6183 KOps/s | 1.6550 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.8708ms | 19.5362ms | 51.1870 Ops/s | 50.9081 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.2563ms | 19.6062ms | 51.0043 Ops/s | 50.8213 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 20.3657ms | 19.5295ms | 51.2045 Ops/s | 51.2125 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 20.2404ms | 19.5785ms | 51.0765 Ops/s | 51.1461 Ops/s | |
test_to_module_speed[True] | 1.0870ms | 0.9415ms | 1.0621 KOps/s | 1.0376 KOps/s | |
test_to_module_speed[False] | 1.3252ms | 0.9345ms | 1.0701 KOps/s | 1.0662 KOps/s | |
test_tc_init | 0.1677ms | 34.5911μs | 28.9091 KOps/s | 27.1921 KOps/s | |
test_tc_init_nested | 0.1591ms | 69.9367μs | 14.2986 KOps/s | 13.3493 KOps/s | |
test_tc_first_layer_tensor | 4.5987μs | 0.7034μs | 1.4217 MOps/s | 1.4391 MOps/s | |
test_tc_first_layer_nontensor | 21.7310μs | 2.3045μs | 433.9289 KOps/s | 427.7962 KOps/s | |
test_tc_second_layer_tensor | 17.9002μs | 1.4124μs | 708.0391 KOps/s | 699.1994 KOps/s | |
test_tc_second_layer_nontensor | 31.5110μs | 3.0366μs | 329.3179 KOps/s | 326.5097 KOps/s | |
test_unbind | 0.2383s | 10.0503ms | 99.4996 Ops/s | 149.9449 Ops/s | |
test_full_like | 12.0698ms | 9.1016ms | 109.8706 Ops/s | 108.6651 Ops/s | |
test_zeros_like | 9.1259ms | 7.1341ms | 140.1711 Ops/s | 137.9878 Ops/s | |
test_ones_like | 5.2220ms | 4.3115ms | 231.9386 Ops/s | 232.4602 Ops/s | |
test_clone | 6.6039ms | 6.2992ms | 158.7496 Ops/s | 158.9083 Ops/s | |
test_squeeze | 66.9510μs | 9.4838μs | 105.4427 KOps/s | 106.9402 KOps/s | |
test_unsqueeze | 0.1533ms | 70.9662μs | 14.0912 KOps/s | 14.0220 KOps/s | |
test_split | 0.3918ms | 0.1566ms | 6.3869 KOps/s | 6.3047 KOps/s | |
test_permute | 0.2252ms | 0.1796ms | 5.5677 KOps/s | 5.6246 KOps/s | |
test_stack | 51.8107ms | 50.5693ms | 19.7749 Ops/s | 19.8849 Ops/s | |
test_cat | 50.5184ms | 50.2491ms | 19.9009 Ops/s | 19.9632 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):