Follow
Minsoo Kim
Minsoo Kim
Verified email at hanyang.ac.kr - Homepage
Title
Cited by
Cited by
Year
Nn-lut: neural approximation of non-linear operations for efficient transformer inference
J Yu, J Park, S Park, M Kim, S Lee, DH Lee, J Choi
Proceedings of the 59th ACM/IEEE Design Automation Conference, 577-582, 2022
402022
Token-scaled logit distillation for ternary weight generative language models
M Kim, S Lee, J Lee, S Hong, DS Chang, W Sung, J Choi
Advances in Neural Information Processing Systems 36, 2023
152023
Understanding and Improving Knowledge Distillation for Quantization-Aware Training of Large Transformer Encoders
M Kim, S Lee, S Hong, DS Chang, J Choi
Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022
82022
Enhancing computation efficiency in large language models through weight and activation quantization
J Lee, M Kim, S Baek, SJ Hwang, W Sung, J Choi
Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023
72023
InfiniPot: Infinite Context Processing on Memory-Constrained LLMs
M Kim, K Shim, J Choi, S Chang
Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024
12024
RA-LoRA: Rank-Adaptive Parameter-Efficient Fine-Tuning for Accurate 2-bit Quantized Large Language Models
M Kim, S Lee, W Sung, J Choi
Findings of the Association for Computational Linguistics ACL 2024, 15773-15786, 2024
12024
Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision Transformers
M Kim, K Shim, S Park, W Sung, J Choi
Proceedings of the 17th Conference of the European Chapter of the …, 2023
12023
RILQ: Rank-Insensitive LoRA-based Quantization Error Compensation for Boosting 2-bit Large Language Model Accuracy
G Lee, J Lee, S Hong, M Kim, E Ahn, DS Chang, J Choi
arXiv preprint arXiv:2412.01129, 2024
2024
Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment
J Lee, S Park, S Hong, M Kim, DS Chang, J Choi
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–9