-
Exponential Family Mixture Model
The dataset used in this paper is a finite mixture model with a finite number of k components from a parametric statistical model P. -
Finite Mixture Model
The dataset used in this paper is a finite mixture model with a finite number of k components from a parametric statistical model P. -
Detonation Flowfield
The dataset used in this analysis comes from high-fidelity hydrogen-air detonation simulations in a 2-dimensional channel configuration. -
Synthetic Control Chart Time Series
The dataset used in the paper is a synthetic control chart time series dataset, which contains control charts synthetically generated by Alcock and Manolopoulos for the six... -
Clustering with Noisy Oracle
Clustering with noisy oracle: We initiate a rigorous theoretical study of clustering with noisy queries (or a faulty oracle). -
Dataset for clustering with missing data
The dataset used in this paper is a collection of 9600 incomplete data sets, each with 10 variables and 25-200 individuals, with varying levels of missing values. -
Group-sparse autoencoders for clustering
The MNIST dataset is used to demonstrate the clustering capability of the proposed group-sparse autoencoder. -
Sampling Matters in Deep Embedding Learning
Deep embeddings answer one simple question: How similar are two images? Learning these embeddings is the bedrock of verification, zero-shot learning, and visual search. -
Shuttle Dataset
The shuttle dataset is used for anomaly clustering and visualization. It contains 9 numerical attributes and 7 classes. -
Real dataset
The dataset used in the paper is a real dataset containing 122 time series representing the evolution, during a mission, of the altitude of a helicopter. -
Synthetic Dataset for T-Phenotype Evaluation
A synthetic dataset for evaluating the performance of T-Phenotype. -
USPS dataset
The USPS dataset consists of 9298 images of handwritten digits 0-9 (10 classes) of 16x16 pixels in gray scale. -
Real-world dataset
The dataset used in this paper for testing the 3D-RecGAN++ model. It contains 1.5k SV and 2.5k CV testing datasets for each of the 6 categories. -
Unsupervised Clustered Federated Learning in Complex Multi-source Acoustic En...
The dataset is a complex multi-source acoustic environment and an improved algorithm for the estimation of source-dominated microphone clusters in acoustic sensor networks. -
Bank Marketing
The dataset comprises of phone call records of a marketing campaign run by a Portuguese bank. The marital status of the clients is considered feature to ensure fairness.