-
MUGEN-GAME
MUGEN-GAME: A large-scale and multimodal dataset for video-audio-text multimodal understanding and generation -
M2Chat: Empowering VLM for Multimodal LLM Interleaved
M2Chat is a novel unified multimodal LLM framework for generating interleaved text-image conversation across various scenarios.