フォロー
Alessandro Stolfo
Alessandro Stolfo
確認したメール アドレス: ethz.ch - ホームページ
タイトル
引用先
引用先
Distilling Reasoning Capabilities into Smaller Language Models
K Shridhar*, A Stolfo*, M Sachan
ACL 2023 (Findings), 2023
168*2023
A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis
A Stolfo, Y Belinkov, M Sachan
EMNLP 2023, 2023
662023
A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models
A Stolfo*, Z Jin*, K Shridhar, B Schölkopf, M Sachan
ACL 2023, 2022
502022
Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
Y Hou, J Li, Y Fei, A Stolfo, W Zhou, G Zeng, A Bosselut, M Sachan
EMNLP 2023, 2023
172023
Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?
A Opedal*, A Stolfo*, H Shirakami, Y Jiao, R Cotterell, B Schölkopf, ...
ICML 2024, 2024
152024
A Simple Unsupervised Approach for Coreference Resolution using Rule-based Weak Supervision
A Stolfo, C Tanner, V Gupta, M Sachan
Proceedings of the 11th Joint Conference on Lexical and Computational …, 2022
72022
Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study
A Stolfo
NAACL 2024 (Findings), 2024
42024
Confidence Regulation Neurons in Language Models
A Stolfo*, B Wu*, W Gurnee, Y Belinkov, X Song, M Sachan, N Nanda
NeurIPS 2024, 2024
32024
Longtonotes: OntoNotes with Longer Coreference Chains
K Shridhar, N Monath, R Thirukovalluru, A Stolfo, M Zaheer, A McCallum, ...
EACL 2023 (Findings), 2022
32022
Improving Instruction-Following in Language Models through Activation Steering
A Stolfo, V Balachandran, S Yousefi, E Horvitz, B Nushi
arXiv preprint arXiv:2410.12877, 2024
2024
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–10