Kazuki Osawa

Cited by

	All	Since 2019
Citations	555	550
h-index	9	9
i10-index	7	7

140

105

20182019202020212022202320244 39 81 118 118 138 56

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Rio YokotaProfessor, Tokyo Institute of TechnologyVerified email at gsic.titech.ac.jp
Satoshi MatsuokaRIKEN Center for Computational Science (R-CCS) / Tokyo Institute of TechnologyVerified email at acm.org
Mohammad Emtiyaz KhanCenter for Advanced Intelligence Project (AIP), RIKEN, TokyoVerified email at postman.riken.jp
Torsten HoeflerProfessor of Computer Science at ETH ZurichVerified email at inf.ethz.ch
Ryo KarakidaAIST (National Institute of Advanced Industrial Science and Technology)Verified email at aist.go.jp

Kazuki Osawa

Google DeepMind

Verified email at google.com - Homepage

Deep Learning Optimization Distributed Parallel Computing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Practical deep learning with Bayesian principles K Osawa, S Swaroop, MEE Khan, A Jain, R Eschenhagen, RE Turner, ... Advances in neural information processing systems 32, 2019	252	2019
Large-Scale Distributed Second-Order Optimization Using Kronecker-Factored Approximate Curvature for Deep Convolutional Neural Networks K Osawa, Y Tsuji, Y Ueno, A Naruse, R Yokota, S Matsuoka The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp …, 2019	143*	2019
Scalable and practical natural gradient for large-scale deep learning K Osawa, Y Tsuji, Y Ueno, A Naruse, CS Foo, R Yokota IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (1), 404-415, 2020	36	2020
Understanding approximate fisher information for fast convergence of natural gradient descent in wide neural networks R Karakida, K Osawa Advances in neural information processing systems 33, 10891-10901, 2020	24	2020
Accelerating matrix multiplication in deep learning by using low-rank approximation K Osawa, A Sekiya, H Naganuma, R Yokota 2017 International Conference on High Performance Computing & Simulation …, 2017	21	2017
Efficient quantized sparse matrix operations on tensor cores S Li, K Osawa, T Hoefler SC22: International Conference for High Performance Computing, Networking …, 2022	14	2022
Rich information is affordable: A systematic performance analysis of second-order optimization using K-FAC Y Ueno, K Osawa, Y Tsuji, A Naruse, R Yokota Proceedings of the 26th ACM SIGKDD International Conference on Knowledge …, 2020	14	2020
Pipefisher: Efficient training of large language models using pipelining and fisher information matrices K Osawa, S Li, T Hoefler Proceedings of Machine Learning and Systems 5, 2023	9	2023
Neural graph databases M Besta, P Iff, F Scheidl, K Osawa, N Dryden, M Podstawski, T Chen, ... Learning on Graphs Conference, 31: 1-31: 38, 2022	9	2022
Asdl: A unified interface for gradient preconditioning in pytorch K Osawa, S Ishikawa, R Yokota, S Li, T Hoefler arXiv preprint arXiv:2305.04684, 2023	8	2023
Understanding gradient regularization in deep learning: Efficient finite-difference computation and implicit bias R Karakida, T Takase, T Hayase, K Osawa International Conference on Machine Learning, 15809-15827, 2023	7	2023
Performance optimizations and analysis of distributed deep learning with approximated second-order optimization method Y Tsuji, K Osawa, Y Ueno, A Naruse, R Yokota, S Matsuoka Workshop Proceedings of the 48th International Conference on Parallel …, 2019	7	2019
Second-order Optimization Method for Large Mini-batch: Training ResNet-50 on ImageNet in 35 Epochs.(2018) K Osawa, Y Tsuji, Y Ueno, A Naruse, R Yokota, S Matsuoka arXiv preprint arXiv:1811.12019, 2018	5	2018
Evaluating the compression efficiency of the filters in convolutional neural networks K Osawa, R Yokota Artificial Neural Networks and Machine Learning–ICANN 2017: 26th …, 2017	4	2017
Improving Continual Learning by Accurate Gradient Reconstructions of the Past E Daxberger, S Swaroop, K Osawa, R Yokota, RE Turner, ... Transactions on Machine Learning Research, 2023	1	2023
Efficient cluster mapping for conditions of weather based on combination of self-organizing map and hierarchical clustering K Osawa, K Kamei, M Ishikawa IEICE Technical Report; IEICE Tech. Rep. 119 (453), 213-218, 2020	1	2020
Accelerating Convolutional Neural Networks Using Low-Rank Tensor Decomposition K Osawa, A Sekiya, H Naganuma, R Yokota IEICE Technical Report; IEICE Tech. Rep. 117 (238), 1-6, 2017		2017
Examination about the salt solution filling packing method of sea urchin (Strongylocentrotus nudus) gonad K Osawa, Y Kado, N Notoya, M Koizumi, S Ishikawa Report of Aomori Prefectural Local Food Research Center (Japan), 2004		2004
Research and development of new processed foods M Koizumi, S Ishikawa, N Notoya, Y Kado, K Osawa Report of Aomori Prefectural Local Food Research Center (Japan), 2004		2004
Effect of calcium ion on gelation of meat of scallop (Patinopecten yessoensis) Y Kado, N Notoya, K Osawa, M Koizumi, S Ishikawa Report of Aomori Prefectural Local Food Research Center (Japan), 2004		2004

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors