You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Since I tried audio-only model first and then tried audio-visual model later, the functions were not written together. I think the idea to combine them is also applicable, just crop the segment of video first and use different functions to handle audio and image parts.
subj
The text was updated successfully, but these errors were encountered: