-
Linguistically Conditioned Semantic Textual Similarity
Semantic textual similarity (STS) is a fundamental NLP task that measures the semantic similarity between a pair of sentences. In order to reduce the inherent ambiguity posed... -
A Novel Design of Hidden Web Crawler using Ontology
A novel design of ontology based adaptive hidden web crawler has been proposed that downloads the Hidden web data by using ontology driven approach. -
Option Comparison Network for Multiple-choice Reading Comprehension
Multiple-choice reading comprehension (MCRC) is the task of selecting the correct answer from multiple options given a question and an article. The authors propose an option... -
GSM8K dataset
The dataset used in the paper is a set of problems for testing the safety of artificial general intelligence (AGI) systems. -
Synthetic arithmetic questions dataset
The dataset used in the paper is a synthetic arithmetic questions dataset, constructed following Liu & Low (2023). The input numbers are randomly generated, ensuring a very... -
Fairness-guided few-shot prompting for large language models
This survey concentrates on few-shots In-Context Learning (ICL) using retrieved examples for large language models, a key aspect of Retrieval-Augmented Generation (RAG). -
Z-icl: Zero-shot in-context learning with pseudo-demonstrations
This survey concentrates on few-shots In-Context Learning (ICL) using retrieved examples for large language models, a key aspect of Retrieval-Augmented Generation (RAG). -
Weakly-supervised visual-retriever-reader for knowledge-based question answering
This survey concentrates on few-shots In-Context Learning (ICL) using retrieved examples for large language models, a key aspect of Retrieval-Augmented Generation (RAG). -
LC-QuAD 2.0
The dataset used for training and testing the MST5 model, which consists of natural language questions paired with their corresponding Wikidata SPARQL queries. -
IIRC: A dataset of incomplete information reading comprehension questions
A dataset for information-seeking questions that require retrieving the necessary information missing from the original context. -
Visconde: Multi-document QA with GPT-3 and Neural Reranking
A multi-document QA system that uses a three-step pipeline to perform the task: decompose, retrieve, and aggregate. -
WebQSPEL and GraphQEL
Entity linking (EL), the task of identifying entities and mapping them to the correct entries in a database, is crucial for analyzing factoid questions and for building robust... -
DocumentQA
The DocumentQA dataset is a benchmark for question answering research. It consists of questions answerable using TF-IDF for paragraph selection. -
Simple SQL queries and mathematical questions
The dataset used in this paper is a collection of simple and complex SQL queries, as well as mathematical questions. The dataset is used to evaluate the task planning and tool...