Penn Tree Bank

doi:doi:10.57702/l0jnm3fd

Penn Tree Bank

Followers: 0

Organization

No Organization

There is no description for this organization

License

No License Provided

Export

DCAT(rdf/xml) DCAT(xml) DCAT(N3) DCAT(ttl) DCAT(jsonld) DataCite CSL DublinCore BibTex

Penn Tree Bank

The Penn Tree Bank dataset is a corpus split into a training, validation and testing set of 929k words, a validation set of 73k words, and a test set of 82k words. The vocabulary has 10k words. The dataset is used for word-level language modeling.

BibTex:

Before browse our site, please accept our cookies policy