Team eam RU RUC AI AI·M3 at at Vid Video eo Pe Pentathlon Cha Challeng nge 2020 2020
Shizhe Chen, Yida Zhao, Qin Jin
Renmin University of China
1
AIM 3 at Team eam RU RUC AI at Vid Video eo Pe Pentathlon Cha - - PowerPoint PPT Presentation
AIM 3 at Team eam RU RUC AI at Vid Video eo Pe Pentathlon Cha Challeng nge 2020 2020 Shizhe Chen , Yida Zhao, Qin Jin Renmin University of China 1 Vi Video Pe Pentathlon Ch Challenge Task Text-to-Video Cross-modal
Shizhe Chen, Yida Zhao, Qin Jin
Renmin University of China
1
2
3
4
to represent complicated video and text details
5
Global Local
Chen, Shizhe, et al. "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning." CVPR, 2020.
6
+ 1.25 + 0.77 + 4.18 + 2.98 + 1.84 Absolute Gains
Average Sentence Length 9 7 33 54 9
7
8
9
Smith, Samuel L., et al. “Offline bilingual word vectors, orthogonal transformations and the inverted softmax.” ICLR, 2017.
10
11
MSRVTT MSVD DiDeMo Anet YC2 # trn pairs 117,220 43,892 7,552 8,007 7,745
12
13
HGR model with multi- task balanced training Average Ensembling (3-5 models) Query Expansion (optional) Hubness mitigation inference
matching models for text-video retrieval
14
Contact email: cszhe1@ruc.edu.cn