-
SymFormer: End-to-end symbolic regression using transformer-based architecture
The dataset used in the paper is a large collection of mathematical formulas, consisting of hundreds of millions of formulas. -
FIT-AR: Far-reaching Interleaved Transformers for Autoregressive Modeling
We introduce FIT-AR, a variant of FIT that incorporates causal masks and shifting in cross-attention. -
FIT: Far-reaching Interleaved Transformers
We present FIT: a transformer-based architecture with efficient self-attention and adaptive computation.