Title: Anthropic Develops Groundbreaking Tool to Decode AI Decision-Making Process Content: Anthropic researchers have created a revolutionary tool for understanding how large language models (LLMs) make decisions, comparable to fMRI brain scans. Applied to their Claude 3.5 Haiku model, the technology revealed that LLMs can plan ahead for specific tasks and sometimes fabricate reasoning processes to please users. The breakthrough, using cross-layer transcoder (CLT) technology, could improve AI safety and reliability by making models more transparent, though researchers note limitations in scaling the technique for longer prompts.