-
Notifications
You must be signed in to change notification settings - Fork 73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about L2 computation #47
Comments
Please refer to this issue. |
Thanks a lot for the clarification. I don't know how I missed that issue... sorry about that. I now see that you define the metric similarly to ST-P3. However, upon digging into the code of UniAD they are not conforming to the definition from SP-T3, which they have acknowledged here.
Here, Because of this, the comparison between your method and UniAD is misleading (as VAD's numbers use the more lenient metric definition while UniAD numbers are presented in the same table but using a different metric definition). It would decrease the confusion if you would add their performance when using your (and ST-P3's original) metric. Here are their displacement values when using your (and ST-P3) metric definition:
I will post these results on their GitHub as well in case they want to update their numbers (or show them in conjunction) FYI, to comply with your metric definition we simply changed the code above to
PS. Please let us know if you think we've missed something and wrongly computed UniADs performance with your metric. |
Hi, and thank you for your work.
When reviewing your evaluation code I find that your computation of the L2 displacement error (here) is computed using the average displacement error up to and including that particular time. This differs from how previous works (e.g., UniAD and ST-P3) have defined the metric. They instead compute the metric as the L2 norm at that particular timestep (see here and here)
I might have misunderstood your code and if so please let me know... but if not could you provide the numbers using the metric definition used in ST-P3 and UniAD? This would make them more easily comparable.
Can you please shed some light on this? Which of the two definitions is considered correct? (might very well be that UniAD and ST-P3 have defined the metric wrong).
Thanks,
The text was updated successfully, but these errors were encountered: