The IBRNet dataset contains 10 scenes, each with 60 training views (with the object to be removed), 40 test views (without the object), and human-annotated object masks per view.
The SPIn-NeRF dataset contains 10 scenes, each with 60 training views (with the object to be removed), 40 test views (without the object), and human-annotated object masks per...