6 datasets found

Tags: vision-language model

Filter Results
  • InternLM-XC

    InternLM-XC is a vision-language large model that supports images with any aspect ratio from 336 pixels up to 4K HD, facilitating its deployment in real-world contexts.
  • InternLM2

    InternLM2 is a vision-language large model that supports images with any aspect ratio from 336 pixels up to 4K HD, facilitating its deployment in real-world contexts.
  • InternLM-XComposer2

    InternLM-XComposer2 is a vision-language large model that supports images with any aspect ratio from 336 pixels up to 4K HD, facilitating its deployment in real-world contexts.
  • InternLM-XComposer2-4KHD

    InternLM-XComposer2-4KHD is a vision-language large model that supports images with any aspect ratio from 336 pixels up to 4K HD, facilitating its deployment in real-world...
  • DebiasVL

    The dataset used in this paper is a vision-language model, specifically debiasVL.
  • Mmicl

    Mmicl: Empowering vision-language model with multi-modal in-context learning
You can also access this registry using the API (see API Docs).