Why the input of model is speed? #26

JackokieZhao · 2022-12-08T14:35:00Z

Hi, thanks for the open source of your greate work. There remains a detail that I cannot understand, so I would appreciate it if you can explain it to be.
In my opinion, the line 260 in file 'baselineutils.py' means that the [0-2] columns in dim 2 denotes pedestrain positions while [2:4] columns in dim 2 denotes the speed of pedestrain.

inp_norm=np.concatenate((inp_te_np,inp_speed),2)
However, the input of model are [2:4] in dim 2, which means the input are speeds rather than positions of pedestrain.
inp=(batch['src'][:,1:,2:4].to(device)-mean.to(device))/std.to(device)
pred=model(inp, dec_inp, src_att, trg_att)
I don't know whether there is any mistick for the unserstanding, could you please expalin it to me ?

The text was updated successfully, but these errors were encountered:

BYZANTINE26 · 2022-12-21T03:46:38Z

Hi @JackokieZhao, you are getting it correct, the model is taking the speed as input and I guess it's not a mistake, I experimented with both cases positions and speeds and I found model actually worked better with speeds, and you can look at it this way - speed is just distance when looked for unit time.

I hope I'm able to resolve your doubt.

JackokieZhao · 2022-12-21T04:53:39Z

Hi @JackokieZhao, you are getting it correct, the model is taking the speed as input and I guess it's not a mistake, I experimented with both cases positions and speeds and I found model actually worked better with speeds, and you can look at it this way - speed is just distance when looked for unit time.

I hope I'm able to resolve your doubt.

Thanks for your answer !
But, if this is the case, I think there are some details on the metrics need to be further determine.

In my opinion, the ADE of the positions dosen't equal the ADE of speed.

We assume there are three positions p1, p2, p3, and corresponding speed v1, v2 (prediction results p1_, p2_, p3_, v1_, v2_).

In general, the ADE of positions should be mean(|p1 - p1_| + |p2 - p2_| + |p3 - p3_|).
However, the speed ADE are mean(|v1 - v1_| + |v2 - v2_|).
For the first speed error, it corresponds the position ADE: |v1 - v1_| == |p2 - p2_|.
What's import is that |v2 - v2_| dose not corresponding to |p3 - p3_|, because it still contains the prediction error of v1_.
In other words, we will acquire the prediction of
p3_ == p1 + v1_ * t + v2_ * t
if we transform the speed prediction to the trajectory prediction.
Here, the position errors are
|p3 - p3_| = |p3 - (p1 + v1_ * t + v2_ * t)| = |v1 * t + v2 * t - v1_ * t - v2_ * t| = |(v1 - v1_) + (v2 - v2)| * t.
Therefore, in my opinion, the metrics of speed ADE and FDE are not equal to positions ADE and FDE.

And, I think this is the reason why the performance of position prediction is much lower than speed prediction.

This is just a academic talk, and I am not aggressive or hostile in any way.
And, there might some mistakes for my understanding.

Thanks a lot!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why the input of model is speed? #26

Why the input of model is speed? #26

JackokieZhao commented Dec 8, 2022 •

edited

Loading

BYZANTINE26 commented Dec 21, 2022

JackokieZhao commented Dec 21, 2022 •

edited

Loading

Why the input of model is speed? #26

Why the input of model is speed? #26

Comments

JackokieZhao commented Dec 8, 2022 • edited Loading

BYZANTINE26 commented Dec 21, 2022

JackokieZhao commented Dec 21, 2022 • edited Loading

JackokieZhao commented Dec 8, 2022 •

edited

Loading

JackokieZhao commented Dec 21, 2022 •

edited

Loading