Easy way to probe result examples? #11

chancharikmitra · 2023-09-22T05:20:31Z

Hello! This is some really interesting work!

This is more of a question. Do you have any detailed analysis of the results just for InstructBLIP and InstructBLIP Vicuna? It would be great if you maybe had some of these results available for the models I mentioned. That way I could look at just the dataset rather than having to rerun the models on the benchmark. I just wanted to probe the success and failure cases in a bit more detail (seeing the example, model response, etc.).

Thanks!

Bohao-Lee · 2023-12-18T04:34:37Z

Thank you for your interest in our work, and we apologize for the delayed response. We have released the GPT-4V evaluation results for SEED-Bench-1 and SEED-Bench-2, which can be found at GPT-4V for SEED-Bench-1 and GPT-4V for SEED-Bench-2. If you're interested, please feel free to explore these results.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Easy way to probe result examples? #11

Easy way to probe result examples? #11

chancharikmitra commented Sep 22, 2023 •

edited

Loading

Bohao-Lee commented Dec 18, 2023

Easy way to probe result examples? #11

Easy way to probe result examples? #11

Comments

chancharikmitra commented Sep 22, 2023 • edited Loading

Bohao-Lee commented Dec 18, 2023

chancharikmitra commented Sep 22, 2023 •

edited

Loading