facebook/wav2vec2-large-north_germanic-voxpopuli-v2
Automatic Speech Recognition
•
Updated
•
7
None defined yet.
TV2TV: A Unified Framework for Interleaved Language and Video Generation
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models