Follow
Maarten Bosma
Maarten Bosma
Microsoft AI
Verified email at microsoft.com
Title
Cited by
Cited by
Year
Chain of thought prompting elicits reasoning in large language models
J Wei, X Wang, D Schuurmans, M Bosma, E Chi, Q Le, D Zhou
NeurIPS 2022, 2022
10106*2022
PaLM: Scaling Language Modeling with Pathways
A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ...
Journal of Machine Learning Research, 2023
51462023
Finetuned language models are zero-shot learners
J Wei, M Bosma, VY Zhao, K Guu, AW Yu, B Lester, N Du, AM Dai, QV Le
ICLR 2022, 2021
31482021
Emergent abilities of large language models
J Wei, Y Tay, R Bommasani, C Raffel, B Zoph, S Borgeaud, D Yogatama, ...
Transactions on Machine Learning Research, 2022b, 2022
2982*2022
Lamda: Language models for dialog applications
R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ...
arXiv preprint arXiv:2201.08239, 2022
16372022
Program synthesis with large language models
J Austin, A Odena, M Nye, M Bosma, H Michalewski, D Dohan, E Jiang, ...
arXiv preprint arXiv:2108.07732, 2021
12612021
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
arXiv preprint arXiv:2206.04615, 2022
11612022
GLaM: Efficient scaling of language models with mixture-of-experts
N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ...
International Conference on Machine Learning, 5547-5569, 2022
696*2022
Show your work: Scratchpads for intermediate computation with language models
M Nye, AJ Andreassen, G Gur-Ari, H Michalewski, J Austin, D Bieber, ...
ICLR 2022 Workshop DL4C, 2021
5462021
Scaling up models and data with t5x and seqio
A Roberts, HW Chung, G Mishra, A Levskaya, J Bradbury, D Andor, ...
Journal of Machine Learning Research 24 (377), 1-8, 2023
1542023
Emergent abilities of large language models. arXiv 2022
J Wei, Y Tay, R Bommasani, C Raffel, B Zoph, S Borgeaud, D Yogatama, ...
arXiv preprint arXiv:2206.07682, 2023
592023
Program synthesis with large language models. CoRR abs/2108.07732 (2021)
J Austin, A Odena, MI Nye, M Bosma, H Michalewski, D Dohan, E Jiang, ...
arXiv preprint arXiv:2108.07732, 2021
542021
A framework for unsupervised spam detection in social networking sites
M Bosma, E Meij, W Weerkamp
European Conference on Information Retrieval, 364-375, 2012
532012
Ichter, b
J Wei, X Wang, D Schuurmans, M Bosma
Xia, F., et al.(2022b).“Chain-of-thought prompting elicits reasoning in …, 0
18
Inflection-1
Inflection-AI
https://inflection.ai/assets/Inflection-1.pdf, 2023
6*2023
System and method for automatically selecting images to accompany text
M Heyward, M Bosma, S Brotherton, C DePue III, MEG Contreras, ...
US Patent 9,075,812, 2015
62015
Performing machine learning tasks using instruction-tuned neural networks
JW Wei, MP Bosma, Y Zhao, K Gu, QV Le
US Patent App. 17/561,581, 2023
42023
Prompting Machine-Learned Models Using Chains of Thought
JW Wei, D Zhou, DE Schuurmans, QV Le, MP Bosma, EHH Chi, ...
US Patent App. 17/881,746, 2023
2023
Inflection-2
Inflection-AI
https://inflection.ai/inflection-2, 2023
2023
Deterministic training of machine learning models
G Mishra, AJ Roberts, NM Shazeer, MP Bosma
US Patent App. 18/219,555, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–20