Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Poor performance when using videos from other datasets #33

Open
JSHZT opened this issue Aug 5, 2024 · 2 comments
Open

Poor performance when using videos from other datasets #33

JSHZT opened this issue Aug 5, 2024 · 2 comments

Comments

@JSHZT
Copy link

JSHZT commented Aug 5, 2024

I used the video from vgg-sound, I used the default question, but I found that the answer from minigpt4-video has nothing to do with the video.

@KerolosAtef
Copy link
Collaborator

Thanks for your feedback
I discovered a bug related to hallucinations in MiniGPT4-video yesterday. It seems to be connected to the PEFT library. I was initially using PEFT 0.2.0, but after upgrading, the function prepare_model_for_int8_training was deprecated. When I switched to prepare_model_for_kbit_training, a significant increase in hallucinations occurred.
Keep this in mind to ensure accurate performance.
It solved in the current version

@JSHZT
Copy link
Author

JSHZT commented Aug 22, 2024

I use question:"Please describe the content of the video only in the following format: 'This video describes [video content], where [subject] appears doing [actions] in [setting/scenery].' Do not provide any additional information or explanations." but the result still cannot get the correct video information. The video I input is a video from the vgg_sound dataset, which is 10 seconds long. Are there any other good usage suggestions?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants