Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

shuffle操作怎么保证句子的位置信息 #16

Open
wsh2836741 opened this issue Sep 13, 2021 · 2 comments
Open

shuffle操作怎么保证句子的位置信息 #16

wsh2836741 opened this issue Sep 13, 2021 · 2 comments

Comments

@wsh2836741
Copy link

你好,麻烦问一下shuffle操作是不是会有问题,比如”蛋糕奶油“和”奶油蛋糕“,经过这种shuffle后是不是体现不出位置重要性了。

@yym6472
Copy link
Owner

yym6472 commented Sep 13, 2021

嗯是会有这样的问题。shuffle策略相当于学习词袋模型,认为两个句子语义上的相关程度和语序无关,从细粒度来看这样确实会有问题(就比如你说的奶油蛋糕和蛋糕奶油);但就粗粒度来看的话,这种方法一定程度上确实能学习到语义的相关性,至少能将那些明显不相关的句子对(很可能不存在词共现)打一个很低的分数。也因此这种策略在我们提的几种方法里效果是最好的,但这也和数据集、评价方法有关。

@wsh2836741
Copy link
Author

了解了,谢谢

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants