-
EnterpriseEM
A dataset comprising a diverse range of internal Infosys Ltd. data, including technical course contents, internal knowledge base articles, standard operating procedures for... -
Query Based Event Extraction Along a Timeline
Query based event extraction along a timeline. -
Leveraging Learning to Rank in an Optimization Framework for Timeline Summari...
Leveraging learning to rank in an optimization framework for timeline summarization. -
Timeline Summarization from Relevant Headlines
Timeline summarization from relevant headlines. -
20NewsGroups
The dataset used in this paper is a collection of documents from various domains, including news, articles, and emails. -
ClueWeb09 dataset
The ClueWeb09 dataset is a large-scale dataset for web search and information retrieval. -
GLOW : Global Weighted Self-Attention Network for Web Search
GLOW is a novel Global Weighted Self-Attention Network for web document search. It leverages global corpus statistics into the deep matching model. -
Synthetic Dataset
The dataset used in this work is a custom synthetic dataset generated using the liquid-dsp library, containing 600000 examples of each of 13.8 million examples, with SNRs... -
Keyword extraction from a single document using word co-occurrence statistica...
A dataset for keyword extraction from a single document using word co-occurrence statistical information. -
KPEVAL: Towards Fine-Grained Semantic-Based Keyphrase Evaluation
A comprehensive evaluation framework for keyphrase systems, including reference agreement, faithfulness, diversity, and utility.