2 datasets found

Tags: vision-language inference

Filter Results
  • VIS4ION

    The VIS4ION dataset is a smart wearable that helps people with blindness and low vision in their daily challenges. It provides multiple microservices based on artificial...
  • InstructBLIP

    The InstructBLIP dataset is a vision-language model for comprehensive scene understanding and textual descriptions.
You can also access this registry using the API (see API Docs).