← 記事一覧に戻る

大規模言語モデルヘルスケアAI AI安全性・アラインメント

Study audits four LLMs for reliability in psychiatric hospitalization risk assessment, finding that clinically insignificant variables increase predicted risk scores and output variability across all models.

arXiv cs.LG · 2026年4月27日

AI要約

•Researchers evaluated Gemini 2.5 Flash, LLaMa 3.3 70b, Claude Sonnet 4.6, and GPT-4o mini using synthetic patient profiles (n = 50) with 15 clinically relevant features and up to 50 clinically insignificant features, tested across four prompt reframings (neutral, logical, human impact, clinical judgment).
•Including medically insignificant variables resulted in a statistically significant increase in absolute mean predicted hospitalization risk and output variability across all models and prompts, indicating reduced predictive stability as contextual noise increased. Prompt variations independently affected the trajectory of instability in a model-dependent manner.
•The findings demonstrate that LLM-based psychiatric risk assessments are sensitive to non-clinical information, highlighting the need for systematic evaluations of attributional stability and uncertainty behavior before clinical deployment.

元記事を読む

関連記事

大規模言語モデル

Article body failed to load; no news content available to summarize.

Hacker News·2026年4月27日

DeployInfra launches AI agents that improve weekly from conversation data, aiming to automate customer service and lead qualification 24/7.

大規模言語モデル

DeployInfra launches AI agents that improve weekly from conversation data, aiming to automate customer service and lead qualification 24/7.

Hacker News·2026年4月27日

大規模言語モデル

No article body provided for analysis.

Hacker News·2026年4月27日

AtlassianがGoogle Cloudとの提携を拡大し、Gemini AIをRovoプラットフォームに統合

大規模言語モデル

AtlassianがGoogle Cloudとの提携を拡大し、Gemini AIをRovoプラットフォームに統合

Yahoo Finance AI·2026年4月27日

NVIDIA、Adobe、WPPが4月20日にエージェントAIをマーケティングに統合する協業を拡大

大規模言語モデル

NVIDIA、Adobe、WPPが4月20日にエージェントAIをマーケティングに統合する協業を拡大

Yahoo Finance AI·2026年4月27日

AIニュースを毎日お届け

200以上のソースから厳選したAIニュースを毎日無料でお届けします。

無料で始める