Hard-coded gaussian attention for neural machine translation W You, S Sun, M Iyyer ACL 2020, 2020 | 73 | 2020 |
RULER: What's the Real Context Size of Your Long-Context Language Models? CP Hsieh, S Sun, S Kriman, S Acharya, D Rekesh, F Jia, Y Zhang, ... arXiv preprint arXiv:2404.06654, 2024 | 70 | 2024 |
Do Long-Range Language Models Actually Use Long-Range Context? S Sun, K Krishna, A Mattarella-Micke, M Iyyer EMNLP 2021, 2021 | 70 | 2021 |
How to compare summarizers without target length? pitfalls, solutions and re-examination of the neural summarization literature S Sun, O Shapira, I Dagan, A Nenkova Proceedings of the Workshop on Methods for Optimizing and Evaluating Neural …, 2019 | 55 | 2019 |
Energy-based reranking: Improving neural machine translation using energy-based models S Bhattacharyya, A Rooshenas, S Naskar, S Sun, M Iyyer, A McCallum ACL 2021, 2020 | 43 | 2020 |
Topicgpt: A prompt-based topic modeling framework CM Pham, A Hoyle, S Sun, P Resnik, M Iyyer arXiv preprint arXiv:2311.01449, 2023 | 41 | 2023 |
Pearl: Prompting large language models to plan and execute actions over long documents S Sun, Y Liu, S Wang, C Zhu, M Iyyer arXiv preprint arXiv:2305.14564, 2023 | 40 | 2023 |
The feasibility of embedding based automatic evaluation for single document summarization S Sun, A Nenkova Proceedings of the 2019 conference on empirical methods in natural language …, 2019 | 23 | 2019 |
Revisiting simple neural probabilistic language models S Sun, M Iyyer NAACL 2021, 2021 | 20 | 2021 |
Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of rlhf S Sun, D Gupta, M Iyyer arXiv preprint arXiv:2309.09055, 2023 | 15 | 2023 |
IGA: An intent-guided authoring assistant S Sun, W Zhao, V Manjunatha, R Jain, V Morariu, F Dernoncourt, ... EMNLP 2021, 2021 | 15 | 2021 |
How does in-context learning help prompt tuning? S Sun, Y Liu, D Iter, C Zhu, M Iyyer arXiv preprint arXiv:2302.11521, 2023 | 14 | 2023 |
ChapterBreak: A Challenge Dataset for Long-Range Language Models S Sun, K Thai, M Iyyer NAACL 2022, 2022 | 14 | 2022 |
Alternative Input Signals Ease Transfer in Multilingual Machine Translation S Sun, A Fan, J Cross, V Chaudhary, C Tran, P Koehn, F Guzmán ACL 2022, 2022 | 12 | 2022 |
Energy-based reranking: Improving neural machine translation using energy-based models S Naskar, A Rooshenas, S Sun, M Iyyer, A McCallum arXiv e-prints, arXiv: 2009.13267, 2020 | 11 | 2020 |
Suri: Multi-constraint instruction following for long-form text generation CM Pham, S Sun, M Iyyer arXiv preprint arXiv:2406.19371, 2024 | 8 | 2024 |
Name disambiguation for chinese scientific authors with multi-level clustering S Sun, H Zhang, N Li, Y Chen 2017 IEEE International Conference on Computational Science and Engineering …, 2017 | 7 | 2017 |
Ruler: What’s the real context size of your long-context language models?, 2024 CP Hsieh, S Sun, S Kriman, S Acharya, D Rekesh, F Jia, Y Zhang, ... URL https://arxiv. org/abs/2404.06654, 0 | 7 | |
Efficiently Upgrading Multilingual Machine Translation Models to Support More Languages S Sun, M Elbayad, A Sun, J Cross EACL 2023, 2023 | 2 | 2023 |
ngpt: Normalized transformer with representation learning on the hypersphere I Loshchilov, CP Hsieh, S Sun, B Ginsburg arXiv preprint arXiv:2410.01131, 2024 | 1 | 2024 |