動作識別——Multi-Model Domain Adaptation for Fine-Grained Action Recognition——CVPR2020 oral

作者信息

Abstract

Fine-grained action recognition datasets exhibit environmental bias, where multiple video sequences are captured from a limited number of environments. Multi-modal nature of video(視頻的多模態性),提出的方法一個是multi-modal self-supervision,還有一個是adversarial training per modality

Introduction

fine-grained action recognition,
舉的例子
不同數據集的比較
Few works have attempted deep UDA for video data《Temporal attentive alignment for large-scale video domain adaptation, ICCV2019》《Deep domain adaptation in action space, BMVC2018》

Conclusion

modality指的是兩種信息(optical flow和RGB信息),future work包含audio

Key points: Motivation很好; 提出的新數據集

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章