Automata and Cognition
Cognitive Architectures
Цели урока
- Understand SOAR principles: problem space, impasse, chunking as online learning
- Master ACT-R math: activation, retrieval time, power law of forgetting
- Understand LIDA: conscious broadcast as an attention mechanism
- Compare architectures and recognize their ideas in modern LLM agents
Предварительные знания
- Global Workspace Theory (lesson 11)
- Basic understanding of IF-THEN production systems
- Intuitive familiarity with neural networks
SOAR flew a fighter jet in 1995. ACT-R predicts fMRI patterns. LIDA engineers GWT. All before LLMs. Modern agents reinvent these ideas from scratch.
- TacAir-Soar (1995): SOAR controlled a virtual F-16 in US Air Force simulators
- ACT-R (2004): model reproduced fMRI patterns during math tasks down to specific brain regions
- LIDA is used to model attention and disorders of consciousness in neuroscience
- ReAct, AutoGPT, LangGraph: implicitly reinvent SOAR (planning) and ACT-R (memory)
- Transformer attention is a partial implementation of GWT, but without explicit winner-take-all
Three Paths to General Intelligence
Allen Newell and John Laird created SOAR in 1983 as an attempt to realize Newell's 'unified theory of cognition'. John Anderson developed ACT-R in 1993, emphasizing mathematical precision and behavior prediction. Stan Franklin created LIDA in 2005, integrating GWT. All three worked in parallel - different views on the same problem.
SOAR: Learning Through Impasse
**1995. TacAir-Soar controls a virtual F-16 in a US Air Force simulator. The system flies, evades missiles, engages targets - and learns while doing it.** Not a neural network, not an LLM. A production system with 50,000 IF-THEN rules that creates new rules through chunking every time it hits an impasse. This is SOAR - State, Operator And Result.
**SOAR** is a problem-oriented architecture by Allen Newell and John Laird (1983). All cognition is movement through problem space: current state + operator = new state. When no operator can be selected, an impasse occurs, a subgoal is created, and after resolution - a new rule (chunk) is generated. Chunking = compilation of experience.
Three Memory Types and the Main Cycle
| Memory | Content | LLM Analog |
|---|---|---|
| Procedural (LTM) | Production rules: IF conditions THEN actions | Model weights |
| Semantic (LTM) | World knowledge | Pretrained knowledge |
| Episodic (LTM) | Event memories | Conversation history |
| Working Memory | Current state (focus) | Context window |
SOAR is outdated symbolic AI, incompatible with neural networks
SOAR ideas (chunking, impasse, memory types) directly influence modern LLM agent architectures
Working Memory = context window, Procedural = weights, Episodic = history. Chunking is analogous to in-context learning. The difference: SOAR models these components explicitly, LLMs do it implicitly in parameters.
What is the difference between the Elaboration and Decision phases in SOAR?
ACT-R: Mathematics of Human Memory
**ACT-R (Adaptive Control of Thought - Rational) does something no neural network can: predict human reaction time to the millisecond.** In 2004, an ACT-R model accurately reproduced fMRI patterns during mathematical problem solving - down to specific brain regions. John Anderson created not just an AI architecture, but a formal model of human cognition.
**ACT-R central bottleneck**: only ONE production rule can execute per cycle (~50 ms). This explains the psychological refractory period - why humans cannot fully parallelize two tasks. Peripheral modules (vision, hearing, hands) work in parallel, but the central processor is strictly sequential.
The Memory Activation Formula
The key idea in ACT-R: each chunk (unit of knowledge) has **activation A = B + S + e**, where B is base-level (usage frequency), S is spreading activation from context, e is noise. Retrieval time: **T = F * exp(-A)**. Higher activation means faster retrieval.
| Phenomenon | ACT-R Prediction | Verification |
|---|---|---|
| Spacing effect | Distributed repetitions increase B more | Confirmed by fMRI 2004 |
| Testing effect | Retrieval strengthens chunk more than re-study | Hundreds of experiments |
| Dual-task interference | One bottleneck = sequential processing | PRP experiments |
| Skill acquisition | 3 stages: cognitive to associative to autonomous | Fitts 1954 + ACT-R |
What does the base-level activation formula B = ln(sum t^{-d}) predict in ACT-R?
LIDA: Consciousness Through Coalition Competition
**LIDA (Learning Intelligent Distribution Agent) is the only cognitive architecture that explicitly implements Global Workspace Theory.** Stan Franklin created it in 2005 as an engineering realization of Baars' theory: consciousness is the winner of competition for the global workspace. Each cognitive cycle (~200-500 ms) is a competition among coalitions for the right to be broadcast to all system modules.
**Codelets** are small specialized processes inside LIDA. Attention codelets search for what is important, behavior codelets propose actions, structure building codelets construct understanding from sensory data. This implements Marvin Minsky's 'society of mind' idea - intelligence as interaction of simple agents.
LIDA implements consciousness, so it is smarter than SOAR and ACT-R
LIDA models one aspect of consciousness (broadcast) but does not solve a broader range of tasks
SOAR excels at strategic planning, ACT-R predicts human performance, LIDA models attention and consciousness. These are different tools for different tasks, not a hierarchy.
Why does LIDA record every conscious broadcast to episodic memory?
Comparison and Application to LLM Agents
**All three architectures solve the same problem in different ways: how to organize intelligence that not only reacts but plans, learns, and remembers.** Modern LLM agents reinvent the same components - often worse, because they lack 40 years of cognitive science behind them.
| Aspect | SOAR | ACT-R | LIDA |
|---|---|---|---|
| Focus | Problem solving | Human cognition | Consciousness |
| Learning | Chunking from impasse | Power law (math) | Through broadcast |
| Cycle time | Not fixed | ~50 ms (bottleneck) | ~200-500 ms |
| Parallelism | Elaboration (all rules) | Modules parallel | Codelets parallel |
| Strength | Strategic reasoning | Behavior prediction | Attention and consciousness |
How These Ideas Live in LLM Agents
| Architecture Component | SOAR/ACT-R/LIDA | LLM Agent |
|---|---|---|
| Working Memory | Explicit buffer (1 chunk) | Context window (thousands of tokens) |
| Long-Term Memory | Separate typed stores | Parameters + RAG + tools |
| Procedural Knowledge | Production rules | In-context patterns |
| Chunking/Learning | Online, during task | Offline training + few-shot |
| Goal Stack | Explicit, hierarchical | Implicit in prompt |
When designing an LLM agent, it pays to implement explicitly: Working Memory (what is in focus now), Goal Stack (goal hierarchy), Episodic Memory (what happened), Meta-Cognition (confidence monitoring). Systems that add these components explicitly are more stable than those where everything is implicit in parameters.
Which component of cognitive architectures differs most from its LLM agent counterpart?
Вопросы для размышления
- When designing an LLM agent for a real task - which components from SOAR/ACT-R/LIDA are worth implementing explicitly, and why cannot one rely on them being emergent from parameters?