You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the code snippet above, I notice that the value of cur_image_idx doesn't change within a single batch. This implies that cur_image_features remain identical for images within the same batch, which seems unusual. Could you confirm if this is the intended behavior?
Another point of confusion I have pertains to the line for j in range(5): and the expression j*16. Based on the settings used in the Resampler, I would expect the image_features to have dimensions [batch_size*8, 64, 5120]. Can you clarify why the image features are selected using for j in range(5): and j*16?
The text was updated successfully, but these errors were encountered:
LLaVA-UHD/llava_uhd/train/llava-uhd/adapt_llava.py
Lines 169 to 173 in 69e75d0
In the code snippet above, I notice that the value of
cur_image_idx
doesn't change within a single batch. This implies thatcur_image_features
remain identical for images within the same batch, which seems unusual. Could you confirm if this is the intended behavior?Another point of confusion I have pertains to the line
for j in range(5):
and the expressionj*16
. Based on the settings used in the Resampler, I would expect theimage_features
to have dimensions [batch_size*8, 64, 5120]. Can you clarify why the image features are selected usingfor j in range(5):
andj*16
?The text was updated successfully, but these errors were encountered: