EndoVis-18-VQLA
EndoVis-18-VQLA dataset is a public dataset with 14 video sequences on robotics surgery procedures. It is combined with the bounding box on tissue-instrument interaction detection tasks and the question-answer pairs from surgical VQA classification tasks.
BibTex: