Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

candidate = news_words[all_train_pn[i]] :IndexError: index 29998 is out of bounds for axis 0 with size 51 #6

Open
Aliang-CN opened this issue Dec 13, 2019 · 9 comments

Comments

@Aliang-CN
Copy link

File "E:/project/IJCAI2019-NAML-master/naml.py", line 278, in generate_batch_data_train
candidate = news_words[all_train_pn[i]] # new[]
IndexError: index 29998 is out of bounds for axis 0 with size 51

HI:
i think new_words are record the words of title,but the all_train_pn are record the id of news,so it get a Error。

@lineshen199466
Copy link

File "E:/project/IJCAI2019-NAML-master/naml.py", line 278, in generate_batch_data_train
candidate = news_words[all_train_pn[i]] # new[]
IndexError: index 29998 is out of bounds for axis 0 with size 51

HI:
i think new_words are record the words of title,but the all_train_pn are record the id of news,so it get a Error。

Clearly, the sample data is limited...

@Aliang-CN
Copy link
Author

File "E:/project/IJCAI2019-NAML-master/naml.py", line 278, in generate_batch_data_train
candidate = news_words[all_train_pn[i]] # new[]
IndexError: index 29998 is out of bounds for axis 0 with size 51
HI:
i think new_words are record the words of title,but the all_train_pn are record the id of news,so it get a Error。

Clearly, the sample data is limited...

我看了源码的处理方式,all_train_pn是收集编码后的title id。实际上,两个csv的记录不一样的,一份是记录title编码后的id,另一份是还没有编码之前的title。有好的数据集通知我一下,就差数据集了。作者提供这份样本数据很难跑得起来。

@Aliang-CN
Copy link
Author

File "E:/project/IJCAI2019-NAML-master/naml.py", line 278, in generate_batch_data_train
candidate = news_words[all_train_pn[i]] # new[]
IndexError: index 29998 is out of bounds for axis 0 with size 51
HI:
i think new_words are record the words of title,but the all_train_pn are record the id of news,so it get a Error。

Clearly, the sample data is limited...

@Aliang-CN Aliang-CN reopened this Dec 16, 2019
@frnk823
Copy link

frnk823 commented Nov 16, 2020

老哥后来解决了这个问题吗

@Aliang-CN
Copy link
Author

老哥后来解决了这个问题吗

好像解决了,用了自己的数据集重写了

@frnk823
Copy link

frnk823 commented Nov 17, 2020

老哥后来解决了这个问题吗

好像解决了,用了自己的数据集重写了

老哥的数据集和代码还留着吗,如果方便的话想试试。我这边跑通了但是有新闻没点击数据,不太好验证。

@Aliang-CN
Copy link
Author

老哥后来解决了这个问题吗

好像解决了,用了自己的数据集重写了

老哥的数据集和代码还留着吗,如果方便的话想试试。我这边跑通了但是有新闻没点击数据,不太好验证。

抱歉啊,老哥,那个是公司的代码

@kayaklin
Copy link

老哥后来解决了这个问题吗

你好,请问你解决这个问题了吗,我不知道怎么去找合适的数据集

@Aliang-CN
Copy link
Author

很早之前的事情,有点忘记了

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants