Rethinking the Evaluation of Video Summaries M Otani, Y Nakashima, E Rahtu, J Heikkilä IEEE Computer Society Conference on Computer Vision and Pattern Recognition …, 2019 | 170 | 2019 |
Video summarization using deep semantic features M Otani, Y Nakashima, E Rahtu, J Heikkilä, N Yokoya Computer Vision–ACCV 2016: 13th Asian Conference on Computer Vision, Taipei …, 2017 | 161 | 2017 |
Bert representations for video question answering Z Yang, N Garcia, C Chu, M Otani, Y Nakashima, H Takemura Proceedings of the IEEE/CVF winter conference on applications of computer …, 2020 | 127 | 2020 |
Learning joint representations of videos and sentences with web image search M Otani, Y Nakashima, E Rahtu, J Heikkilä, N Yokoya European Conference on Computer Vision Workshop, 651-667, 2016 | 103 | 2016 |
Constrained graphic layout generation via latent optimization K Kikuchi, E Simo-Serra, M Otani, K Yamaguchi Proceedings of the 29th ACM International Conference on Multimedia, 88-96, 2021 | 100 | 2021 |
Layoutdm: Discrete diffusion model for controllable layout generation N Inoue, K Kikuchi, E Simo-Serra, M Otani, K Yamaguchi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 99 | 2023 |
KnowIT VQA: Answering knowledge-based questions about videos N Garcia, M Otani, C Chu, Y Nakashima Proceedings of the AAAI conference on artificial intelligence 34 (07), 10826 …, 2020 | 97 | 2020 |
Uncovering Hidden Challenges in Query-Based Video Moment Retrieval M Otani, Y Nakashima, E Rahtu, J Heikkilä British Machine Vision Conference, 2020 | 77 | 2020 |
Toward verifiable and reproducible human evaluation for text-to-image generation M Otani, R Togashi, Y Sawai, R Ishigami, Y Nakashima, E Rahtu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 69 | 2023 |
A dataset and baselines for visual question answering on art N Garcia, C Ye, Z Liu, Q Hu, M Otani, C Chu, Y Nakashima, T Mitamura Computer Vision–ECCV 2020 Workshops: Glasgow, UK, August 23–28, 2020 …, 2020 | 64 | 2020 |
Alleviating cold-start problems in recommendation through pseudo-labelling over knowledge graph R Togashi, M Otani, S Satoh Proceedings of the 14th ACM international conference on web search and data …, 2021 | 45 | 2021 |
Does robustness on imagenet transfer to downstream tasks? Y Yamada, M Otani Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 37 | 2022 |
Modeling visual containment for web page layout optimization K Kikuchi, M Otani, K Yamaguchi, E Simo‐Serra Computer Graphics Forum 40 (7), 33-44, 2021 | 19 | 2021 |
A comparative study of language transformers for video question answering Z Yang, N Garcia, C Chu, M Otani, Y Nakashima, H Takemura Neurocomputing 445, 121-133, 2021 | 19 | 2021 |
Optimal correction cost for object detection evaluation M Otani, R Togashi, Y Nakashima, E Rahtu, J Heikkilä, S Satoh Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 18 | 2022 |
The laughing machine: Predicting humor in video Y Kayatani, Z Yang, M Otani, N Garcia, C Chu, Y Nakashima, H Takemura Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2021 | 17 | 2021 |
Video summarization using textual descriptions for authoring video blogs M Otani, Y Nakashima, T Sato, N Yokoya Multimedia Tools and Applications 76, 12097-12115, 2017 | 17 | 2017 |
Towards flexible multi-modal document models N Inoue, K Kikuchi, E Simo-Serra, M Otani, K Yamaguchi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 13 | 2023 |
iParaphrasing: Extracting Visually Grounded Paraphrases via an Image C Chu, M Otani, Y Nakashima International Conference on Computational Linguistics, 3479–3492, 2018 | 12 | 2018 |
Textual description-based video summarization for video blogs M Otani, Y Nakashima, T Sato, N Yokoya 2015 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2015 | 12 | 2015 |