Exploring visual relationship for image captioning T Yao, Y Pan, Y Li, T Mei Proceedings of the European conference on computer vision (ECCV), 684-699, 2018 | 964 | 2018 |
Boosting image captioning with attributes T Yao, Y Pan, Y Li, Z Qiu, T Mei Proceedings of the IEEE international conference on computer vision, 4894-4902, 2017 | 799 | 2017 |
X-linear attention networks for image captioning Y Pan, T Yao, Y Li, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 573 | 2020 |
Transferrable prototypical networks for unsupervised domain adaptation Y Pan, T Yao, Y Li, Y Wang, CW Ngo, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 385 | 2019 |
Contextual transformer networks for visual recognition Y Li, T Yao, Y Pan, T Mei IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (2), 1489-1500, 2022 | 356 | 2022 |
Hierarchy parsing for image captioning T Yao, Y Pan, Y Li, T Mei Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 217 | 2019 |
Jointly localizing and describing events for dense video captioning Y Li, T Yao, Y Pan, H Chao, T Mei Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 201 | 2018 |
Incorporating copying mechanism in image captioning for learning novel objects T Yao, Y Pan, Y Li, T Mei Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 169 | 2017 |
Temporal deformable convolutional encoder-decoder networks for video captioning J Chen, Y Pan, Y Li, T Yao, H Chao, T Mei Proceedings of the AAAI conference on artificial intelligence 33 (01), 8167-8174, 2019 | 111 | 2019 |
Wave-vit: Unifying wavelet and transformers for visual representation learning T Yao, Y Pan, Y Li, CW Ngo, T Mei European Conference on Computer Vision, 328-345, 2022 | 94 | 2022 |
Exploring category-agnostic clusters for open-set domain adaptation Y Pan, T Yao, Y Li, CW Ngo, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 80 | 2020 |
Pointing novel objects in image captioning Y Li, T Yao, Y Pan, H Chao, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 78 | 2019 |
Comprehending and ordering semantics for image captioning Y Li, Y Pan, T Yao, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 75 | 2022 |
Auto-captions on GIF: A large-scale video-sentence dataset for vision-language pre-training Y Pan, Y Li, J Luo, J Xu, T Yao, T Mei Proceedings of the 30th ACM International Conference on Multimedia, 7070-7074, 2022 | 63 | 2022 |
Dual vision transformer T Yao, Y Li, Y Pan, Y Wang, XP Zhang, T Mei IEEE transactions on pattern analysis and machine intelligence, 2023 | 59 | 2023 |
Scheduled sampling in vision-language pretraining with decoupled encoder-decoder network Y Li, Y Pan, T Yao, J Chen, T Mei Proceedings of the AAAI Conference on Artificial Intelligence 35 (10), 8518-8526, 2021 | 58 | 2021 |
Learning Deep Intrinsic Video Representation by Exploring Temporal Coherence and Graph Structure. Y Pan, Y Li, T Yao, T Mei, H Li, Y Rui IJCAI, 3832-3838, 2016 | 56 | 2016 |
Unpaired image captioning with semantic-constrained self-learning H Ben, Y Pan, Y Li, T Yao, R Hong, M Wang, T Mei IEEE Transactions on Multimedia 24, 904-916, 2021 | 43 | 2021 |
CoCo-BERT: Improving video-language pre-training with contrastive cross-modal matching and denoising J Luo, Y Li, Y Pan, T Yao, H Chao, T Mei Proceedings of the 29th ACM International Conference on Multimedia, 5600-5608, 2021 | 38 | 2021 |
Semantic-conditional diffusion networks for image captioning J Luo, Y Li, Y Pan, T Yao, J Feng, H Chao, T Mei Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 37 | 2023 |