Publications & Activities

Education

Doctoral Student / 博士後期課程 [2025.04 - ]
Graduate Institute for Advanced Studies, SOKENDAI. / 総合研究大学院大学
Supervisor: Assoc. Prof. Sho Yokoi
Master of Information Science / 修士（情報科学） [2023.04 - 2025.03]
Graduate School of Information Sciences, Tohoku University. / 東北大学大学院情報科学研究科
Supervisor: Prof. Jun Suzuki & Assoc. Prof. Keisuke Sakaguchi
Dean Award / 研究科長賞 (4/126)
Bachelor of Engineering / 学士（工学） [2020.04 - 2023.03]
School of Engineering, Tohoku University. / 東北大学工学部
Supervisor: Prof. Kentaro Inui & Assoc. Prof. Keisuke Sakaguchi
Early Graduation / 早期卒業 (1/252)

International Conferences

Mutsumi Sasaki, Go Kamoda, Ryosuke Takahashi, Kosuke Sato, Benjamin Heinzerling, Keisuke Sakaguchi, & Kentaro Inui (2025).
Can Language Models Handle a Non-Gregorian Calendar? The Case of the Japanese wareki.
In Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (IJCNLP-AACL 2025).
ACL Anthology Google Scholar arXiv
Tatsuro Inaba, Go Kamoda, Kentaro Inui, Masaru Isonuma, Yusuke Miyao, Yohei Oseki, Yu Takagi, & Benjamin Heinzerling (2025).
How a Bilingual LM Becomes Bilingual: Tracing Internal Representations with Sparse Autoencoders.
In Findings of the Association for Computational Linguistics: EMNLP 2025.
ACL Anthology Google Scholar arXiv
Ryosuke Takahashi, Go Kamoda, Benjamin Heinzerling, Keisuke Sakaguchi, & Kentaro Inui (2025).
Understanding the Side Effects of Rank-One Knowledge Editing.
In BlackboxNLP 2025: The 8th Workshop on Analyzing and Interpreting Neural Networks for NLP.
ACL Anthology Google Scholar arXiv
Go Kamoda, Benjamin Heinzerling, Tatsuro Inaba, Keito Kudo, Keisuke Sakaguchi, & Kentaro Inui (2025).
Weight-based Analysis of Detokenization in Language Models: Understanding the First Stage of Inference Without Inference.
In Findings of the Association for Computational Linguistics: NAACL 2025.
ACL Anthology arXiv Google Scholar GitHub
Hiroki Deguchi, Go Kamoda, Yusuke Matsushita, Chihiro Taguchi, Masaki Waga, Kohei Suenaga, & Sho Yokoi (2025).
SoftMatcha: A Soft and Fast Pattern Matcher for Billion-Scale Corpus Searches.
In The Thirteenth International Conference on Learning Representations (ICLR 2025).
OpenReview arXiv Google Scholar Project Page
Go Kamoda, Akari Asai, Ana Brassard, & Keisuke Sakaguchi (2025).
Quantifying the Influence of Evaluation Aspects on Long-Form Response Assessment.
In Proceedings of the 31st International Conference on Computational Linguistics (COLING 2025).
ACL Anthology Google Scholar GitHub
Go Kamoda, Benjamin Heinzerling, Keisuke Sakaguchi, & Kentaro Inui (2023).
Test-time Augmentation for Factual Probing.
In Findings of the Association for Computational Linguistics: EMNLP 2023.
ACL Anthology arXiv Google Scholar GitHub

Domestic Conferences

米田優峻, 鴨田豪, 松下祐介, 末永幸平, 秋葉拓哉, 和賀正樹, 横井祥 (2026).
SoftMatcha 2: 一兆語規模のコーパスに対する柔らかく超高速な検索システム.
言語処理学会第32回年次大会.
Project Page NLP 2026 プログラム
大橋諭貴, 木谷頼斗, 佐藤宏亮, 高橋良允, 鴨田豪, 山本悠士, 塩野大輝, 坂口慶祐, 小林悟郎 (2026).
注意機構における Attention Sink のバイアス項的解釈.
言語処理学会第32回年次大会.
NLP 2026 プログラム
木谷頼斗, 大橋諭貴, 佐藤宏亮, 鴨田豪, 高橋良允, 山本悠士, 塩野大輝, 坂口慶祐, 小林悟郎 (2026).
Attention Sink および Massive Activation の発生機序の分解.
言語処理学会第32回年次大会.
NLP 2026 プログラム
鴨田豪, 熊谷雄介, 松井孝太, 横井祥 (2025).
密度比の直接推定に基づく言語モデルの出力較正.
第20回言語処理若手シンポジウム (YANS).
佐藤宏亮, 鴨田豪, Benjamin Heinzerling, 坂口慶祐 (2025).
言語モデルの内部表現における文法情報の局所性について.
言語処理学会第31回年次大会, pp. 697-701.
予稿
佐々木睦史, 高橋良允, 鴨田豪, Benjamin Heinzerling, 坂口慶祐, 乾健太郎 (2025).
LM は日本の時系列構造をどうエンコードするか.
言語処理学会第31回年次大会, pp. 2642-2647.
日本経済新聞社 CDIO室賞
予稿
工藤慧音, 鴨田豪, 塩野大輝, 鈴木潤 (2025).
日本語バイト符号化マスク言語モデルの開発と分析.
言語処理学会第31回年次大会, pp. 3356-3361.
予稿 ByBERT-JP ByGPT-JP
小林春斗, 原知正, 鴨田豪, 横井祥 (2025).
層の冗長性と層同士の独立性に基づく言語モデルの層交換の成否の特徴づけ.
言語処理学会第31回年次大会, pp. 1751-1756.
若手奨励賞 (20/487)
予稿
鴨田豪, Benjamin Heinzerling, 稲葉達郎, 工藤慧音, 坂口慶祐, 乾健太郎 (2025).
言語モデルのパラメータから探るDetokenizationメカニズム.
言語処理学会第31回年次大会, pp. 634-639.
予稿
出口祥之, 鴨田豪, 松下祐介, 田口智大, 末永幸平, 和賀正樹, 横井祥 (2024).
SoftMatcha: 大規模コーパス検索のための柔らかくも高速なパターンマッチャー.
言語処理学会第31回年次大会, pp. 3310-3315.
予稿
小林春斗, 原知正, 鴨田豪, 横井祥 (2024).
層同士の接続可能性と各層が影響を与える部分空間の重なり度合いの関係性.
第19回言語処理若手シンポジウム (YANS).
若手奨励賞 (23/187)
*伊藤郁海, *鴨田豪, 熊谷雄介, 横井祥 (2024).
事前学習–文脈内学習パラダイムで生じる頻度バイアスの較正.
第19回言語処理若手シンポジウム (YANS).
出口祥之, 鴨田豪, 松下祐介, 慶田開, 和賀正樹, 横井祥 (2024).
柔らかいgrep/KWICに向けて：高速単語列マッチングの埋め込み表現による連続化.
第19回言語処理若手シンポジウム (YANS).
デモ賞 (1/15),
株式会社リクルート賞
Project Page
熊谷雄介, *伊藤郁海, *鴨田豪, 横井祥 (2024).
大規模言語モデルの情報推薦バイアスの較正.
2024年度人工知能学会全国大会 (第38回), pp. 3F1-GS-10-03.
予稿
良允高橋, 鴨田豪, Benjamin Heinzerling, 坂口慶祐, 乾健太郎 (2024).
言語モデルからの知識削除：頻出実体の知識は副作用が破滅的.
言語処理学会第30回年次大会, pp. 2864-2869.
若手奨励賞 (18/427)
予稿
鴨田豪, 浅井明里, Ana Brassard, 坂口慶祐 (2024).
長文生成の多面的評価:人手評価と自動評価の向上を目指して.
言語処理学会第30回年次大会, pp. 673-678.
優秀賞 (12/599)
予稿
Go Kamoda, Benjamin Heinzerling, Keisuke Sakaguchi, Kentaro Inui (2023).
Test-time Augmentation for Factual Probing.
言語処理学会第29回年次大会, pp. 1350-1355.
予稿

Preprints

Masataka Yoneda, Yusuke Matsushita, Go Kamoda, Kohei Suenaga, Takuya Akiba, Masaki Waga, & Sho Yokoi (2026).
SoftMatcha 2: A Fast and Soft Pattern Matcher for Trillion-Scale Corpora.
In arXiv [cs.CL].
Project Page arXiv

Experiences

SOKENDAI Special Researcher Program [2025.04 - Present]
Supported by JST BOOST.
SOKENDAI Special Researcher Program
NINJAL Part-time Researcher [2025.04 - Present]
NINJAL
Tohoku University GP-DS Research Assistant [2024.04 - 2025.03]
Competitive research fellowship
GP-DS
Visiting Student at NLP Department, MBZUAI [2024.02 - 2024.03]
MBZUAI
Hakuhodo DY Holdings Inc. Joint Research [2023.10 - Present]
AKATSUKI-SICA [2023.10 - 2024.02]
Social Impact Creators' Accelerator Program
Supported by Ministry of Economy, Trade and Industry of Japan
AKATSUKI-SICA Certificate (Open Badge)
NS Solutions R&D Internship [2023.09 - 2023.09]
NS Solutions
AI王 Committee Member [2023.05 - 2024.01]
AI王 YouTube News: 東洋経済 News: Tech+
Infratop (DMM WebCamp) [2021.03 - 2023.08]
Programming Mentor & School Management Member
DMM WebCamp