Exploring visual relationship for image captioning T Yao, Y Pan, Y Li, T Mei Proceedings of the European conference on computer vision (ECCV), 684-699, 2018 | 1034 | 2018 |
Boosting image captioning with attributes T Yao, Y Pan, Y Li, Z Qiu, T Mei Proceedings of the IEEE international conference on computer vision, 4894-4902, 2017 | 838 | 2017 |
X-linear attention networks for image captioning Y Pan, T Yao, Y Li, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 646 | 2020 |
Contextual transformer networks for visual recognition Y Li, T Yao, Y Pan, T Mei IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (2), 1489-1500, 2022 | 506 | 2022 |
Transferrable prototypical networks for unsupervised domain adaptation Y Pan, T Yao, Y Li, Y Wang, CW Ngo, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 425 | 2019 |
Hierarchy parsing for image captioning T Yao, Y Pan, Y Li, T Mei Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 227 | 2019 |
Jointly localizing and describing events for dense video captioning Y Li, T Yao, Y Pan, H Chao, T Mei Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 210 | 2018 |
Incorporating copying mechanism in image captioning for learning novel objects T Yao, Y Pan, Y Li, T Mei Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 177 | 2017 |
Wave-vit: Unifying wavelet and transformers for visual representation learning T Yao, Y Pan, Y Li, CW Ngo, T Mei European Conference on Computer Vision, 328-345, 2022 | 135 | 2022 |
Temporal deformable convolutional encoder-decoder networks for video captioning J Chen, Y Pan, Y Li, T Yao, H Chao, T Mei Proceedings of the AAAI conference on artificial intelligence 33 (01), 8167-8174, 2019 | 119 | 2019 |
Comprehending and ordering semantics for image captioning Y Li, Y Pan, T Yao, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 104 | 2022 |
Exploring category-agnostic clusters for open-set domain adaptation Y Pan, T Yao, Y Li, CW Ngo, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 87 | 2020 |
Pointing novel objects in image captioning Y Li, T Yao, Y Pan, H Chao, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 86 | 2019 |
Dual vision transformer T Yao, Y Li, Y Pan, Y Wang, XP Zhang, T Mei IEEE transactions on pattern analysis and machine intelligence 45 (9), 10870 …, 2023 | 85 | 2023 |
Semantic-conditional diffusion networks for image captioning J Luo, Y Li, Y Pan, T Yao, J Feng, H Chao, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 65 | 2023 |
Auto-captions on GIF: A large-scale video-sentence dataset for vision-language pre-training Y Pan, Y Li, J Luo, J Xu, T Yao, T Mei Proceedings of the 30th ACM International Conference on Multimedia, 7070-7074, 2022 | 65 | 2022 |
Scheduled sampling in vision-language pretraining with decoupled encoder-decoder network Y Li, Y Pan, T Yao, J Chen, T Mei Proceedings of the AAAI Conference on Artificial Intelligence 35 (10), 8518-8526, 2021 | 62 | 2021 |
Learning Deep Intrinsic Video Representation by Exploring Temporal Coherence and Graph Structure. Y Pan, Y Li, T Yao, T Mei, H Li, Y Rui IJCAI, 3832-3838, 2016 | 56 | 2016 |
Unpaired image captioning with semantic-constrained self-learning H Ben, Y Pan, Y Li, T Yao, R Hong, M Wang, T Mei IEEE Transactions on Multimedia 24, 904-916, 2021 | 50 | 2021 |
CoCo-BERT: Improving video-language pre-training with contrastive cross-modal matching and denoising J Luo, Y Li, Y Pan, T Yao, H Chao, T Mei Proceedings of the 29th ACM International Conference on Multimedia, 5600-5608, 2021 | 46 | 2021 |