Where can I download the SNLI corpus?

The SNLI corpus can be downloaded from the SNLI project page at https://nlp.stanford.edu/projects/snli/.

SNLI | Find AI List

Overview

The Stanford Natural Language Inference (SNLI) Corpus is a collection of 570k human-written English sentence pairs, manually labeled for balanced classification with the labels entailment, contradiction, and neutral. It serves as a benchmark for evaluating representational systems for text, including those induced by representation-learning methods, and as a resource for developing NLP models. The corpus is used for Natural Language Inference (NLI), also known as Recognizing Textual Entailment (RTE), which is the task of determining the inference relation between two texts. SNLI is distributed in both JSON lines and tab separated value files. Researchers and developers in natural language processing and machine learning use it to train and evaluate models for tasks such as text understanding and semantic reasoning. The corpus includes content from the Flickr 30k and VisualGenome corpora.

Common tasks

Training NLI models Evaluating text representation systems Developing NLP models Benchmarking semantic reasoning capabilities Analyzing sentence relationships Building text understanding systems Researching inference relations between texts

FAQ

View all

What is the SNLI corpus?

The SNLI corpus is a collection of 570k human-written English sentence pairs labeled for entailment, contradiction, and neutral relations.

What is Natural Language Inference (NLI)?

NLI, also known as Recognizing Textual Entailment (RTE), is the task of determining the inference relation between two texts.

How is the SNLI corpus licensed?

The Stanford Natural Language Inference Corpus is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

In which formats is the SNLI corpus distributed?

The corpus is distributed in both JSON lines and tab-separated value files.

FAQ+

What is the SNLI corpus?

The SNLI corpus is a collection of 570k human-written English sentence pairs labeled for entailment, contradiction, and neutral relations.

What is Natural Language Inference (NLI)?

NLI, also known as Recognizing Textual Entailment (RTE), is the task of determining the inference relation between two texts.

How is the SNLI corpus licensed?

The Stanford Natural Language Inference Corpus is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

In which formats is the SNLI corpus distributed?

The corpus is distributed in both JSON lines and tab-separated value files.

View all

Compare with top alternatives

Full compare

Tool	Pricing	Rating	Visits
SNLICurrent	Free	-	-
HellaSwag	Free	★ 0.0	-
Cityscapes Dataset	Free	★ 0.0	-
KITTI Dataset	Free	★ 0.0	-

SNLI

Current

Pricing: Free
Rating: -
Visits: -

HellaSwag

Pricing: Free
Rating: ★ 0.0
Visits: -

Cityscapes Dataset

Pricing: Free
Rating: ★ 0.0
Visits: -

KITTI Dataset

Pricing: Free
Rating: ★ 0.0
Visits: -

SNLI

Should you use SNLI?

Overview

FAQ

Pricing

Pros & Cons

Compare with top alternatives

More tools from Nlp

Reviews & Ratings