Data Level Lottery Ticket Hypothesis for Vision Transformers

The conventional lottery ticket hypothesis (LTH) claims that there exists a sparse subnetwork within a dense neural network and a proper random initial-ization method called the winning ticket, such that it can be trained from scratch to almost as good as the dense counterpart.

BibTex: