Learning spatio-temporal representation with pseudo-3d residual networks Z Qiu, T Yao, T Mei proceedings of the IEEE International Conference on Computer Vision, 5533-5541, 2017 | 2118 | 2017 |
Msr-vtt: A large video description dataset for bridging video and language J Xu, T Mei, T Yao, Y Rui Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 2041 | 2016 |
Exploring visual relationship for image captioning T Yao, Y Pan, Y Li, T Mei Proceedings of the European conference on computer vision (ECCV), 684-699, 2018 | 1035 | 2018 |
Boosting Image Captioning with Attributes T Yao, Y Pan, Y Li, Z Qiu, T Mei ICCV, 2017 | 836 | 2017 |
Jointly modeling embedding and translation to bridge video and language Y Pan, T Mei, T Yao, H Li, Y Rui Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 700 | 2016 |
X-linear attention networks for image captioning Y Pan, T Yao, Y Li, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 644 | 2020 |
Contextual transformer networks for visual recognition Y Li, T Yao, Y Pan, T Mei IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (2), 1489-1500, 2022 | 494 | 2022 |
Transferrable prototypical networks for unsupervised domain adaptation Y Pan, T Yao, Y Li, Y Wang, CW Ngo, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 425 | 2019 |
Video captioning with transferred semantic attributes Y Pan, T Yao, H Li, T Mei Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 412 | 2017 |
Fully convolutional adaptation networks for semantic segmentation Y Zhang, Z Qiu, T Yao, D Liu, T Mei Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 411 | 2018 |
Gaussian temporal awareness networks for action localization F Long, T Yao, Z Qiu, X Tian, J Luo, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 403 | 2019 |
Memory matching networks for one-shot image recognition Q Cai, Y Pan, T Yao, C Yan, T Mei Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 361 | 2018 |
Exploring object relation in mean teacher for cross-domain detection Q Cai, Y Pan, CW Ngo, X Tian, L Duan, T Yao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 344 | 2019 |
Highlight detection with pairwise deep ranking for first-person video summarization T Yao, T Mei, Y Rui Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 331 | 2016 |
Semi-supervised domain adaptation with subspace learning for visual recognition T Yao, Y Pan, CW Ngo, H Li, T Mei Proceedings of the IEEE conference on Computer Vision and Pattern …, 2015 | 270 | 2015 |
Relation distillation networks for video object detection J Deng, Y Pan, T Yao, W Zhou, H Li, T Mei Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 251 | 2019 |
Learning spatio-temporal representation with local and global diffusion Z Qiu, T Yao, CW Ngo, X Tian, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 230 | 2019 |
Hierarchy parsing for image captioning T Yao, Y Pan, Y Li, T Mei Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 227 | 2019 |
Multi-scale triplet cnn for person re-identification J Liu, ZJ Zha, QI Tian, D Liu, T Yao, Q Ling, T Mei Proceedings of the 24th ACM international conference on Multimedia, 192-196, 2016 | 211 | 2016 |
Jointly localizing and describing events for dense video captioning Y Li, T Yao, Y Pan, H Chao, T Mei Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 209 | 2018 |