1 dataset found

Tags: vision-language tasks

Filter Results
  • InternLM2

    InternLM2 is a vision-language large model that supports images with any aspect ratio from 336 pixels up to 4K HD, facilitating its deployment in real-world contexts.
You can also access this registry using the API (see API Docs).