AV-HuBERT is a self-supervised representation learning framework for audio-visual speech that...

Tokens:4,371
Snippets:64
Trust Score:9.5
Update:2 weeks ago
Tokens:
Raw