Wednesday, January 12, 2022

On ViViT: A Video Vision Transformer

Very recommendable! Perhaps, the first pure transformer-based model for video classification achieving SOTA results.

[2103.15691] ViViT: A Video Vision Transformer

No comments: