Cite this as

Adam Botach, Evgenii Zheltonozhskii, Chaim Baskin (2024). Dataset: End-to-End Referring Video Object Segmentation with Multimodal Transformers. Resource: Original Metadata. https://doi.org/10.57702/48kufbtr

DOI retrieved: December 2, 2024

Additional Information

Field Value
Created December 2, 2024
Last updated December 2, 2024
Format JSON