ViLLa-X is a PyTorch implementation of a Vision-Language-Action model that enhances latent action...

Tokens:2,363
Snippets:17
Trust Score:9.9
License:MIT
Update:2 months ago
Tokens:
Raw