Armen Aghajanyan

Cited by

	All	Since 2019
Citations	2490	2481
h-index	19	19
i10-index	21	21

1200

600

300

900

2019202020212022202320247 37 176 490 1177 588

Co-authors

Luke ZettlemoyerUniversity of Washington; MetaVerified email at cs.washington.edu
Mike LewisFacebook AI ResearchVerified email at fb.com
Sonal GuptaResearcher at GoogleVerified email at google.com
Scott Wen-tau YihFAIR at MetaVerified email at meta.com
Gargi GhoshMeta AI ResearchVerified email at fb.com
Mandar JoshiGoogle AIVerified email at google.com
Naman GoyalFacebook AI ResearchVerified email at gatech.edu
Florian MetzeCarnegie Mellon University; Meta AIVerified email at andrew.cmu.edu
Marjan GhazvininejadResearch Scientist, FAIR (Facebook AI Research)Verified email at fb.com

Armen Aghajanyan

Facebook AI Research

Verified email at fb.com

Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Videoclip: Contrastive pre-training for zero-shot video-text understanding H Xu, G Ghosh, PY Huang, D Okhonko, A Aghajanyan, F Metze, ... arXiv preprint arXiv:2109.14084, 2021	402	2021
Incoder: A generative model for code infilling and synthesis D Fried, A Aghajanyan, J Lin, S Wang, E Wallace, F Shi, R Zhong, W Yih, ... arXiv preprint arXiv:2204.05999, 2022	373	2022
Intrinsic dimensionality explains the effectiveness of language model fine-tuning A Aghajanyan, L Zettlemoyer, S Gupta arXiv preprint arXiv:2012.13255, 2020	316	2020
Muppet: Massive multi-task representations with pre-finetuning A Aghajanyan, A Gupta, A Shrivastava, X Chen, L Zettlemoyer, S Gupta arXiv preprint arXiv:2101.11038, 2021	240	2021
Better fine-tuning by reducing representational collapse A Aghajanyan, A Shrivastava, A Gupta, N Goyal, L Zettlemoyer, S Gupta arXiv preprint arXiv:2008.03156, 2020	215	2020
Pre-training via paraphrasing M Lewis, M Ghazvininejad, G Ghosh, A Aghajanyan, S Wang, ... Advances in Neural Information Processing Systems 33, 18470-18481, 2020	145	2020
Memorization without overfitting: Analyzing the training dynamics of large language models K Tirumala, A Markosyan, L Zettlemoyer, A Aghajanyan Advances in Neural Information Processing Systems 35, 38274-38290, 2022	134	2022
Cm3: A causal masked multimodal model of the internet A Aghajanyan, B Huang, C Ross, V Karpukhin, H Xu, N Goyal, D Okhonko, ... arXiv preprint arXiv:2201.07520, 2022	119	2022
Improving passage retrieval with zero-shot question generation DS Sachan, M Lewis, M Joshi, A Aghajanyan, W Yih, J Pineau, ... arXiv preprint arXiv:2204.07496, 2022	73	2022
Htlm: Hyper-text pre-training and prompting of language models A Aghajanyan, D Okhonko, M Lewis, M Joshi, H Xu, G Ghosh, ... arXiv preprint arXiv:2107.06955, 2021	62	2021
Scaling autoregressive multi-modal models: Pretraining and instruction tuning L Yu, B Shi, R Pasunuru, B Muller, O Golovneva, T Wang, A Babu, B Tang, ... arXiv preprint arXiv:2309.02591 2 (3), 2023	57*	2023
Retrieval-augmented multimodal language modeling M Yasunaga, A Aghajanyan, W Shi, R James, J Leskovec, P Liang, ... arXiv preprint arXiv:2211.12561, 2022	49	2022
Conversational semantic parsing A Aghajanyan, J Maillard, A Shrivastava, K Diedrick, M Haeger, H Li, ... arXiv preprint arXiv:2009.13655, 2020	49	2020
Megabyte: Predicting million-byte sequences with multiscale transformers L Yu, D Simig, C Flaherty, A Aghajanyan, L Zettlemoyer, M Lewis Advances in Neural Information Processing Systems 36, 2024	45	2024
Scaling laws for generative mixed-modal language models A Aghajanyan, L Yu, A Conneau, WN Hsu, K Hambardzumyan, S Zhang, ... International Conference on Machine Learning, 265-279, 2023	42	2023
Semantic representations using structural ontology for assistant systems A Aghajanyan, S Gupta, B Moran, TF Levin, CANSH Nakatsu, D Difranco, ... US Patent 11,688,022, 2023	31	2023
D4: Improving llm pretraining via document de-duplication and diversification K Tirumala, D Simig, A Aghajanyan, A Morcos Advances in Neural Information Processing Systems 36, 2024	25	2024
Non-autoregressive semantic parsing for compositional task-oriented dialog A Babu, A Shrivastava, A Aghajanyan, A Aly, A Fan, M Ghazvininejad arXiv preprint arXiv:2104.04923, 2021	23	2021
Softtarget regularization: An effective technique to reduce over-fitting in neural networks A Aghajanyan 2017 3rd IEEE International Conference on Cybernetics (CYBCONF), 1-5, 2017	20	2017
Retronlu: Retrieval augmented task-oriented semantic parsing V Gupta, A Shrivastava, A Sagar, A Aghajanyan, D Savenkov arXiv preprint arXiv:2109.10410, 2021	19	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors