-
DiffusionTalker: Personalization and Acceleration for Speech-Driven 3D Face
Speech-driven 3D facial animation has been an attractive task in both academia and industry. Traditional methods mostly focus on learning a deterministic mapping from speech to... -
KITTI-Object
Monocular 3D object localization in driving scenes is a crucial task, but challenging due to its ill-posed nature. Estimating 3D coordinates for each pixel on the object surface... -
Nico Challenge
The Nico challenge is a benchmark for out-of-distribution generalization for image recognition challenges. -
ImageNet 2012 Large-Scale Visual Recognition Challenge
The dataset used in the paper is the ImageNet 2012 Large-Scale Visual Recognition Challenge dataset. -
HeadSculpt Testing Dataset
The dataset used in the paper for testing the HeadSculpt model, which consists of 3D head avatars generated from textual prompts. -
HeadSculpt Dataset
The dataset used in the paper for training and testing the HeadSculpt model, which consists of 3D head avatars generated from textual prompts. -
PhysioNet 2012
The dataset used in this paper for healthcare data democratization and information leakage prevention. -
IVP-VAE: Modeling EHR Time Series with Initial Value Problem Solvers
Continuous-time models such as Neural ODEs and Neural Flows have shown promising results in analyzing irregu- -
SODA-D and SODA-A
The SODA-D and SODA-A datasets are large-scale benchmarks tailored for small object detection. -
ASSIST Dataset
The ASSIST dataset is an open dataset collected by the ASSISTments online tutoring systems. -
Math Dataset
The Math dataset is collected from the widely-used online learning system Zhixue1, which contains mathematical exercises and logs of high school examinations. -
Traffic4cast Challenge Solution
The traffic4cast dataset contains real-world traffic data for 3 cities (Berlin, Istanbul, and Moscow) with 5-minute aggregated information. -
Learning Fine-grained Fact-Article Correspondence in Legal Cases
This paper proposes a method for learning fine-grained fact-article correspondence in legal cases. -
BERT-XS: A Pre-trained Language Model for Chinese Long Documents
This paper proposes BERT-XS, a pre-trained language model for Chinese long documents. -
Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents
This paper proposes Lawformer, a pre-trained language model for Chinese legal long documents. -
CaseEncoder: A Knowledge-enhanced Pre-trained Model for Legal Case Encoding
Legal case retrieval is a critical process for modern legal information systems. This paper proposes CaseEncoder, a pre-trained encoder that utilizes fine-grained legal... -
A deep learning approach for person identification using ear biometrics
Ear recognition using CNNs.