Follow
Simeng Sun
Simeng Sun
Verified email at nvidia.com - Homepage
Title
Cited by
Cited by
Year
Hard-coded gaussian attention for neural machine translation
W You, S Sun, M Iyyer
ACL 2020, 2020
732020
RULER: What's the Real Context Size of Your Long-Context Language Models?
CP Hsieh, S Sun, S Kriman, S Acharya, D Rekesh, F Jia, Y Zhang, ...
arXiv preprint arXiv:2404.06654, 2024
702024
Do Long-Range Language Models Actually Use Long-Range Context?
S Sun, K Krishna, A Mattarella-Micke, M Iyyer
EMNLP 2021, 2021
702021
How to compare summarizers without target length? pitfalls, solutions and re-examination of the neural summarization literature
S Sun, O Shapira, I Dagan, A Nenkova
Proceedings of the Workshop on Methods for Optimizing and Evaluating Neural …, 2019
552019
Energy-based reranking: Improving neural machine translation using energy-based models
S Bhattacharyya, A Rooshenas, S Naskar, S Sun, M Iyyer, A McCallum
ACL 2021, 2020
432020
Topicgpt: A prompt-based topic modeling framework
CM Pham, A Hoyle, S Sun, P Resnik, M Iyyer
arXiv preprint arXiv:2311.01449, 2023
412023
Pearl: Prompting large language models to plan and execute actions over long documents
S Sun, Y Liu, S Wang, C Zhu, M Iyyer
arXiv preprint arXiv:2305.14564, 2023
402023
The feasibility of embedding based automatic evaluation for single document summarization
S Sun, A Nenkova
Proceedings of the 2019 conference on empirical methods in natural language …, 2019
232019
Revisiting simple neural probabilistic language models
S Sun, M Iyyer
NAACL 2021, 2021
202021
Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of rlhf
S Sun, D Gupta, M Iyyer
arXiv preprint arXiv:2309.09055, 2023
152023
IGA: An intent-guided authoring assistant
S Sun, W Zhao, V Manjunatha, R Jain, V Morariu, F Dernoncourt, ...
EMNLP 2021, 2021
152021
How does in-context learning help prompt tuning?
S Sun, Y Liu, D Iter, C Zhu, M Iyyer
arXiv preprint arXiv:2302.11521, 2023
142023
ChapterBreak: A Challenge Dataset for Long-Range Language Models
S Sun, K Thai, M Iyyer
NAACL 2022, 2022
142022
Alternative Input Signals Ease Transfer in Multilingual Machine Translation
S Sun, A Fan, J Cross, V Chaudhary, C Tran, P Koehn, F Guzmán
ACL 2022, 2022
122022
Energy-based reranking: Improving neural machine translation using energy-based models
S Naskar, A Rooshenas, S Sun, M Iyyer, A McCallum
arXiv e-prints, arXiv: 2009.13267, 2020
112020
Suri: Multi-constraint instruction following for long-form text generation
CM Pham, S Sun, M Iyyer
arXiv preprint arXiv:2406.19371, 2024
82024
Name disambiguation for chinese scientific authors with multi-level clustering
S Sun, H Zhang, N Li, Y Chen
2017 IEEE International Conference on Computational Science and Engineering …, 2017
72017
Ruler: What’s the real context size of your long-context language models?, 2024
CP Hsieh, S Sun, S Kriman, S Acharya, D Rekesh, F Jia, Y Zhang, ...
URL https://arxiv. org/abs/2404.06654, 0
7
Efficiently Upgrading Multilingual Machine Translation Models to Support More Languages
S Sun, M Elbayad, A Sun, J Cross
EACL 2023, 2023
22023
ngpt: Normalized transformer with representation learning on the hypersphere
I Loshchilov, CP Hsieh, S Sun, B Ginsburg
arXiv preprint arXiv:2410.01131, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–20