-
Pandalm Dataset
The dataset used to train Pandalm, a generative safety evaluator for Chinese. -
Auto-J Dataset
The dataset used to train Auto-J, a generative safety evaluator for English. -
Jade Dataset
The dataset used to train Jade, a linguistic-based safety evaluation platform for Chinese. -
ShieldLM Dataset
The dataset used to train ShieldLM, a generative safety evaluator for English. -
SAFETY-J Dataset
The dataset used to train SAFETY-J, a bilingual generative safety evaluator for English and Chinese. -
Greaselm: Graph Reasoning Enhanced Language Models for Question Answering
Greaselm: Graph reasoning enhanced language models for question answering -
Dense Passage Retrieval for Open-Domain Question Answering
Dense passage retrieval for open-domain question answering -
Large language models struggle to learn long-tail knowledge
Large language models struggle to learn long-tail knowledge -
Semantics in Question Answering
Semanitic parsing on freebase from question-answer pairs -
Automatic Question-Answer Generation for Long-Tail Knowledge
Automatic Question-Answer Generation for Long-Tail Knowledge -
Off-Topic Memento Dataset
The dataset used to evaluate the effectiveness of different similarity measures for identifying off-topic mementos. -
Forgetting in Answer Set Programming
The dataset used in the paper is a set of answer set programs and their corresponding V-HT-models. -
ELI5, FinanceQA, MultiNews, and QMSum datasets
The ELI5, FinanceQA, MultiNews, and QMSum datasets were used in the paper. -
SearchSnippets
The paper discusses the use of multi-objective Bayesian optimization for hyperparameter transfer in topic models. -
MS MARCO Passage Ranking (MARCO Dev Passage)
Dense retrieval (DR) has shown promising results in information retrieval. In essence, DR requires high-quality text representations to support effective search in the... -
Singapore Rapid Transit Systems Regulations
Singapore Rapid Transit Systems Regulations is a collection of regulations proclaimed by the Singapore government.