New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

add preprocess for BigST #198

Closed

superarthurlx wants to merge 49 commits into GestaltCogTeam:master from superarthurlx:master

Contributor

superarthurlx commented Nov 28, 2024

对应BigST里面的--use_long参数，感觉单独作为一个模型的复现比较合理，之后BigST需要读取BigSTPreprocess保存的下来的参数（之后有时间再处理一下BigST的代码），源代码使用的是最后一个epoch的，按照BasicST的框架应该可以选择best_val的或者last epoch的，不知道对于BigST的效果有何影响。另外preprocess这部分代码跑起来确实非常慢。

superarthurlx and others added 30 commits

October 14, 2024 13:54


          'crossgnn'

7fee6c8


          Merge branch 'GestaltCogTeam:master' into master

ae73e32


          'ad-bigst'

4e4f331


          'ad-bigst'

e410942


          'update-bigst'

18c5e78


          fix: 🐛 a bug in tsformer runner

b8300a6


          'solve-bigst-conflict'

62d5198


          fix: 🐛 a bug in TSFormer runner

19ef3fa


          Merge branch 'GestaltCogTeam:master' into master

6aa6a1a


          'add-fredformer'

12b17e4


          add baseline fredformer (GestaltCogTeam#163)

90a9eb3

* 'add-fredformer'

---------

Co-authored-by: Yisong Fu <[email protected]>


          Merge branch 'GestaltCogTeam:master' into master

4e2c4ad


          Merge branch 'GestaltCogTeam:master' into master

ec4fad6


          Update base_epoch_runner.py

33bdb09


          Merge branch 'GestaltCogTeam:master' into master

b502384


          Merge branch 'GestaltCogTeam:master' into master

801028f


          Merge branch 'GestaltCogTeam:master' into master

fbcd6bd


          Merge branch 'GestaltCogTeam:master' into master

2df340f


          Merge branch 'GestaltCogTeam:master' into master

7554c5f


          Merge branch 'GestaltCogTeam:master' into master

733f12a


          'first-add-CycleNet'

cf0af17


          Merge branch 'GestaltCogTeam:master' into master

1938eb8


          'first-add-dlinear-glaff'

3e3376a


          'update-dlinear-glaff'

4645d51


          Merge branch 'GestaltCogTeam:master' into master

4d31abf


          Merge branch 'GestaltCogTeam:master' into master

5eca99a


          'update-scripts-weather'

ef076c3


          'update-dlinear-glaff

d348b0e


          'update-dlinear-glaff


          Merge branch 'GestaltCogTeam:master' into master

4eaf477

superarthurlx and others added 19 commits

November 11, 2024 17:52


          'add-sumba'

8ec1cd5


          'del'

6e28698


          'add'

43da595


          'add-sumba'

f08ced2


          'update-sumba'

f690d63


          Update generate_training_data.py

ec59180

Remove seed in the regular setting.


          Merge branch 'GestaltCogTeam:master' into master

01b2d3e


          'add-cats'

b1f434d


          Merge branch 'GestaltCogTeam:master' into master

c8be4c9


          Merge branch 'GestaltCogTeam:master' into master

c94a193


          Merge branch 'GestaltCogTeam:master' into master

4db1ff8


          Merge branch 'GestaltCogTeam:master' into master


          'add-SOFTS'

c257af7


          Merge branch 'GestaltCogTeam:master' into master

7d75598


          Merge branch 'GestaltCogTeam:master' into master

fb50f05


          Merge branch 'GestaltCogTeam:master' into master

b9ea1d6


          Merge branch 'GestaltCogTeam:master' into master

ab43ccd


          Merge branch 'GestaltCogTeam:master' into master

0d7b1ed


          'addBigSTPreprocess'

7deda9f

duyifanict requested a review from ChengqingYu

November 28, 2024 08:40

ChengqingYu commented Nov 29, 2024

对应BigST里面的--use_long参数，感觉单独作为一个模型的复现比较合理，之后BigST需要读取BigSTPreprocess保存的下来的参数（之后有时间再处理一下BigST的代码），源代码使用的是最后一个epoch的，按照BasicST的框架应该可以选择best_val的或者last epoch的，不知道对于BigST的效果有何影响。另外preprocess这部分代码跑起来确实非常慢。

你好，我在PEMS04上测试这个模型，如果batch size设置为64，就会产生以下错误，设置为1就是正常的，请问这有问题吗？

x_ = x[:, :, idx_perm[j*self.tiny_batch_size:(j+1)*self.tiny_batch_size], :]
IndexError: index 18800 is out of bounds for dimension 0 with size 307

Contributor Author

superarthurlx commented Nov 29, 2024

没有问题，BigST的preprocess部分需要保证batch-size是1，具体可以看源代码中BigST中preprocess的部分或者BigSTPreprocess自己的runner：

B, T, N, F = x.shape
batch_num = int(B * N / self.tiny_batch_size) # 似乎要确保不能等于0
idx_perm = np.random.permutation([i for i in range(B*N)])
for j in range(batch_num):
if j==batch_num-1:
x_ = x[:, :, idx_perm[(j+1)*self.tiny_batch_size:], :]
y_ = y[:, :, idx_perm[(j+1)self.tiny_batch_size:], :]
else:
x_ = x[:, :, idx_perm[jself.tiny_batch_size:(j+1)self.tiny_batch_size], :]
y_ = y[:, :, idx_perm[jself.tiny_batch_size:(j+1)*self.tiny_batch_size], :]

这里是从全部节点中选择一部分节点，因此idx_perm的数值不能超过节点个数N，因此需要B=1，除此之外batch_num也不能是零不然也会报错，基本上tiny_batch_size不超过N就行，我理解的是这样。

ChengqingYu commented Nov 29, 2024

还有一个问题，我测试了一下

没有问题，BigST的preprocess部分需要保证batch-size是1，具体可以看源代码中BigST中preprocess的部分或者BigSTPreprocess自己的runner：

B, T, N, F = x.shape batch_num = int(B * N / self.tiny_batch_size) # 似乎要确保不能等于0 idx_perm = np.random.permutation([i for i in range(B*N)]) for j in range(batch_num): if j==batch_num-1: x_ = x[:, :, idx_perm[(j+1)*self.tiny_batch_size:], :] y_ = y[:, :, idx_perm[(j+1)self.tiny_batch_size:], :] else: x = x[:, :, idx_perm[j_self.tiny_batch_size:(j+1)self.tiny_batch_size], :] y = y[:, :, idx_perm[j_self.tiny_batch_size:(j+1)*self.tiny_batch_size], :]

这里是从全部节点中选择一部分节点，因此idx_perm的数值不能超过节点个数N，因此需要B=1，除此之外batch_num也不能是零不然也会报错，基本上tiny_batch_size不超过N就行，我理解的是这样。

你好，我还有一个问题，我在PEMS04上测试模型的性能，同样的设置下，没有加preprocess误差要低一些（性能更好），请问你测试过性能

Contributor Author

superarthurlx commented Nov 29, 2024

是用BigST加上了preprocess学到的feat之后的测试吗？没有调参的情况下我测试过BigST在12-12实验设置下的结果，表现一般，加上preprocess没有测过。

ChengqingYu commented Nov 29, 2024

是用BigST加上了preprocess学到的feat之后的测试吗？没有调参的情况下我测试过BigST在12-12实验设置下的结果，表现一般，加上preprocess没有测过。

你好，我用BigST和BigSTpreprocess同时在PEMS04数据上测试，保持超参数一致，batch size都是1的情况下，用12预测12
BigST的mae是19多
BigSTpreprocess的mae有22到23

superarthurlx closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet