ログイン
Language:

WEKO3

  • トップ
  • ランキング
To
lat lon distance
To

Field does not validate



インデックスリンク

インデックスツリー

メールアドレスを入力してください。

WEKO

One fine body…

WEKO

One fine body…

アイテム

  1. 原著論文

Spurious reconstruction from brain activity.

https://repo.qst.go.jp/records/2003239
https://repo.qst.go.jp/records/2003239
aa1c7f45-de32-4d55-94d7-53054d96c444
アイテムタイプ 学術雑誌論文 / Journal Article(1)
公開日 2026-04-14
タイトル
タイトル Spurious reconstruction from brain activity.
言語 en
言語
言語 eng
資源タイプ
資源タイプ識別子 http://purl.org/coar/resource_type/c_6501
資源タイプ journal article
著者 Ken Shirakawa

× Ken Shirakawa

Ken Shirakawa

Search repository
Yoshihiro Nagano

× Yoshihiro Nagano

Yoshihiro Nagano

Search repository
Misato Tanaka

× Misato Tanaka

Misato Tanaka

Search repository
Shuntaro C Aoki

× Shuntaro C Aoki

Shuntaro C Aoki

Search repository
Yusuke Muraki

× Yusuke Muraki

Yusuke Muraki

Search repository
Kei Majima

× Kei Majima

Kei Majima

Search repository
Yukiyasu Kamitani

× Yukiyasu Kamitani

Yukiyasu Kamitani

Search repository
抄録
内容記述タイプ Abstract
内容記述 Advances in brain decoding, particularly in visual image reconstruction, have sparked discussions about the societal implications and ethical considerations of neurotechnology. As reconstruction methods aim to recover visual experiences from brain activity and achieve prediction beyond training samples (zero-shot prediction), it is crucial to assess their capabilities and limitations to inform public expectations and regulations. Our case study of recent text-guided reconstruction methods, which leverage a large-scale dataset (Natural Scenes Dataset, NSD) and text-to-image diffusion models, reveals critical limitations in their generalizability, demonstrated by poor reconstructions on a different dataset. UMAP visualization of the text features from NSD images shows limited diversity with overlapping semantic and visual clusters between training and test sets. We identify that clustered training samples can lead to "output dimension collapse," restricting predictable output feature dimensions. While diverse training data improves generalization over the entire feature space without requiring exponential scaling, text features alone prove insufficient for mapping to the visual space. Our findings suggest that the apparent realism in current text-guided reconstructions stems from a combination of classification into trained categories and inauthentic image generation (hallucination) through diffusion models, rather than genuine visual reconstruction. We argue that careful selection of datasets and target features, coupled with rigorous evaluation methods, is essential for achieving authentic visual image reconstruction. These insights underscore the importance of grounding interdisciplinary discussions in a thorough understanding of the technology's current capabilities and limitations to ensure responsible development.
書誌情報 Neural networks : the official journal of the International Neural Network Society

巻 190, p. 107515, 発行日 2025-05
ISSN
収録物識別子タイプ ISSN
収録物識別子 1879-2782
PubMed番号
識別子タイプ PMID
関連識別子 40499302
DOI
識別子タイプ DOI
関連識別子 10.1016/j.neunet.2025.107515
戻る
0
views
See details
Views

Versions

Ver.1 2026-05-08 01:22:58.044979
Show All versions

Share

Share
tweet

Cite as

Other

print

エクスポート

OAI-PMH
  • OAI-PMH JPCOAR 2.0
  • OAI-PMH JPCOAR 1.0
  • OAI-PMH DublinCore
  • OAI-PMH DDI
Other Formats
  • JSON
  • BIBTEX
  • ZIP

コミュニティ

確認

確認

確認


Powered by WEKO3


Powered by WEKO3