Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于在VIG中使用超像素作为patch输入 #241

Open
TonyF815 opened this issue Jan 31, 2024 · 2 comments
Open

关于在VIG中使用超像素作为patch输入 #241

TonyF815 opened this issue Jan 31, 2024 · 2 comments

Comments

@TonyF815
Copy link

大佬,我想请教一下有没有方法可以将超像素分割的结果作为Patch输入到VIG模型中,以获取更好的的语义信息,更好地完成分类任务?因为超像素的结果都不是标准尺寸的,我想知道有没有可能将不同尺寸的patch转换成相同大小的embedding?

@iamhankai
Copy link
Member

非常赞同,我最开始就想用超像素来做;但是受限于超像素的尺寸不标准,没有想到简洁的方法能去让他们标准,就暂时放弃了。如果你能搞定,那就太强了!

@Joazs
Copy link

Joazs commented Feb 8, 2024

将超像素内的每个像素点的特征做平均,得到一个特征向量,这个特征向量代表这个超像素patch的特征。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants