Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video
Data and Resources
-
Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Cite this as
Minsu Kim, Chae Won Kim, Yong Man Ro (2024). Dataset: Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video. https://doi.org/10.57702/pyhelu8l
DOI retrieved: December 16, 2024
Additional Info
Field | Value |
---|---|
Created | December 16, 2024 |
Last update | December 16, 2024 |
Defined In | https://doi.org/10.48550/arXiv.2303.08670 |
Author | Minsu Kim |
More Authors |
|