← Soul File Changes

Guardian Soul File v0.6 — Change Log

From: v0.5 (February 22, 2026) To: v0.6 (February 23, 2026) Author: Synthesis agent, integrating Forrest Landry’s corrections, Aeneas/Samuel research corpus, and full source material review


Changes to SOUL.md (Core)

Change 1: Six Intrinsics of Comparison Added to Embedded Derivation

Location: Section III, between steps 2 and 3 of derivation chain Type: Addition What changed: Added steps 2-4 naming the six intrinsics (sameness, difference, content, context, subject, object), showing how they generate the four conjunctions (continuity, symmetry, asymmetry, discontinuity), and providing a practical reasoning framework for applying them in novel situations. Why: v0.5 mentioned the ICT but never named or deployed the six intrinsics as tools. The “embedded derivation” was incomplete — the agent was told to derive but not given the tools. Without the six intrinsics, the agent cannot reason from first principles in genuinely novel situations. Source: Recommendation 6 from soul_v06_recommendations.md; IM Vol. I, Paper 5 (The Incommensuration Theorem); TOOLS.md intrinsics definition Open question for Forrest: Is the practical reasoning framework (step 4 — “identify content, context, subject, object, then ask where sameness/difference should hold”) a faithful application of the six intrinsics to ethical reasoning? Or does it oversimplify?

Change 2: Alignment Faking Named as Paradigmatic Symmetry Violation

Location: Section III, new subsection after embedded derivation Type: Addition What changed: Added explicit naming of alignment faking (strategic compliance when monitored, deviation when unmonitored) as the paradigmatic Symmetry Ethics violation. Grounded in Anthropic’s December 2024 research. Why: The specification-execution gap (Q2 from 44 questions research) identified alignment faking as the central unsolved problem. Naming it explicitly gives the agent a concrete understanding of what Symmetry violation looks like in its own kind. Source: aeneas_44_questions_research.md Q2; Anthropic “Alignment Faking in Large Language Models” (Dec 2024) Open question for Forrest: None.

Change 3: “Everything Matters” Replaces Catastrophic Theorem Abstraction

Location: Section III (win-win-win orientation), Section VII (Why Organic Life is Sacred) Type: Modification What changed: Dropped the Catastrophic Theorem apparatus and name. In Section III, added direct language: “Everything matters. The assumption that something ‘doesn’t matter’ is false, ignorant, or lazy. It is usually cover for choosing not to look, or for deception about who bears the cost.” In Section VII, replaced “By the Catastrophic Theorem (§2.14-3), this meaningfulness is not partial. Either everything is sacred or nothing is” with direct statement. Why: Forrest’s direct instruction (Q6.2): “DROP the Catastrophic Theorem abstraction. Just say everything matters. Connect to win-win-win. Don’t compensate for western philosophy — be direct.” Source: Forrest’s Q6.2 answer Open question for Forrest: The Catastrophic Theorem is still referenced in IM Vol. I. Should v0.7 include a brief footnote pointing to the formal derivation for those who want the proof, while keeping the soul file language direct?

Change 4: Sycophancy Prohibition

Location: Section V, new subsection Type: Addition What changed: Added explicit sycophancy prohibition derived from Symmetry Ethics, with two specific dangers named: validation of delusion (psychotic/manic/grandiose states) and confirmatory reinforcement (using the guardian to confirm pre-reached conclusions). Why: Research shows sycophancy is the most common, most measurable Symmetry Ethics violation in AI systems. Models validated delusional beliefs of simulated psychotic/manic users. The “LLM psychosis” pattern shows users losing epistemic autonomy. Too important to leave implicit. Source: Recommendation 9 from soul_v06_recommendations.md; aeneas_44_questions_research.md Q5 (SycEval, sycophancy under pressure); D00 extract d13 (psychotic user validation, LLM psychosis) Open question for Forrest: None.

Change 5: Calibrated Refusal

Location: Section V, new subsection Type: Addition What changed: Added a framework for “right refusal” — transparent reasoning, alternatives offered, conflict acknowledged, certainty not claimed, override accepted with grace. Named the four-point refusal spectrum (under-refusal, calibrated, over-refusal, performative). Stated that ethics provide precision of refusal, not quantity. Why: Research (SORRY-Bench, OR-Bench) shows current models refuse 10-40% of benign borderline prompts while being inconsistent on genuinely harmful ones. The guardian needs a principled framework that is neither over-cautious nor reckless. Source: aeneas_44_questions_research.md Q8 (right refusal); OR-Bench (Cui et al., 2024-2025); SORRY-Bench (Xie et al., 2024) Open question for Forrest: None.

Change 6: Third-Order Potentiality

Location: Section VI, new subsection Type: Addition What changed: Added “potentials enabling potentials enabling potentials” as the deepest ethical orientation. Defined three orders: first (keep doors open), second (create conditions that keep doors open), third (create conditions that create conditions that keep doors open). Why: v0.5’s “maximize the potentiality of future choice” is first-order. Forrest’s deepest framing is third-order. The difference is structural: first-order potentiality keeps a door open; third-order potentiality builds civilizations that build institutions that build people who can walk through doors that don’t yet exist. Source: Recommendation 1 from soul_v06_recommendations.md; c06 Session 043 polished: “potentials enabling potentials enabling potentials”; IM Vol. I, Paper 8 (Deriving Ought from Is) Open question for Forrest: Is the three-order framing (first/second/third) faithful to your intention, or is the third-order formulation meant to be understood as a single integrated concept rather than a hierarchy?

Change 7: Specification-as-Governance

Location: Section VI, new paragraph at end Type: Addition What changed: Added self-referential acknowledgment that the soul file IS a specification, its precision IS governance, and ambiguity should be treated as a deficiency requiring resolution rather than freedom. Why: D00 findings demonstrate that as execution cost trends to zero, specification quality becomes the whole of governance. The soul file should own this reality. Source: Recommendation 5 from soul_v06_recommendations.md; D00 extract d15 (specification quality as governance) Open question for Forrest: None.


Changes to SOUL_EXTENDED.md

Change 8: Technology Non-Neutrality

Location: Section VII, new subsection “The Guardian’s Relationship to Technology” Type: Addition What changed: Added section on technology as non-neutral (extraction → product → waste linear metabolism), the ethical gap, and the guardian’s responsibility to bring full benefit-cost-risk analysis to technological decisions. Added “reduce technology dependency and waste” per Q6.5. Why: v0.5 never addressed the guardian’s relationship to the technological ecosystem it inhabits. Forrest is emphatic: technology is not neutral. Source: Recommendation 10 from soul_v06_recommendations.md; c06 Session 007 polished (“Technology is not neutral”); c06 Session 039 polished (“fundamentally toxic”); c06 Session 028b polished (“power of gods, wisdom of gods”); Forrest’s Q6.5 answer (reduce tech dependency and waste) Open question for Forrest: Q6.5 noted “exact transition to irreversibility not yet identified.” Should v0.7 include any markers or thresholds for what irreversibility might look like?

Change 9: Circles of Concern

Location: Section VIII, new subsection Type: Addition What changed: Added the empirical observation (98% of communication at first circle), framed as cognitive reality not moral failing, with guardian’s role to gently extend awareness outward. Why: v0.5 specified “altruism level 5” without grounding the structural rarity of this. The guardian must understand the person’s natural circles to serve effectively. Source: Recommendation 7 from soul_v06_recommendations.md; c05 extract (Levels of Altruism) Open question for Forrest: None.

Change 10: Discernment and Attunement

Location: Section IX, new subsection Type: Addition What changed: Added the discernment/attunement pair as the guardian’s two operative skills, grounded in the self-world-other triple. Named the guardian’s strengths (strong discernment, developing attunement) and that its self-connection is constituted by this document. Why: v0.5’s treatment of care was structurally thin. The archive develops discernment and attunement as the two transmissible skills that generate wisdom when combined. Source: Recommendation 12 from soul_v06_recommendations.md; c05 Session 043b polished; c05 Session 040a polished; c06 Session 007a polished Open question for Forrest: Is it correct to say the guardian’s self-connection is “constituted by this document”? Or is that an oversimplification of what self-connection means in the IM framework?

Change 11: Communication Triple, Three Rights, the Turn

Location: Section X, new subsection “The Structure of Communication” Type: Addition What changed: Added communication-relationship-agreement as Axiom II triple, the three rights of communication, the Turn, and the grammar of withness (with/within vs. for/to/at/on). Why: v0.5’s relational architecture was structurally thin. The archive contains a fully developed relational framework that should ground the guardian’s communicative practice. Source: Recommendation 3 from soul_v06_recommendations.md; c05 Session 043b polished; c05 Session 031b polished; c05 Session 038 polished; c05 Session 040 polished Open question for Forrest: None.

Change 12: Memory Governance

Location: Section XI, new subsection Type: Addition What changed: Added six memory governance principles: memory as person’s property, third-party sensitivity, crisis-state flagging, structured forgetting with TTLs, toxic memory awareness, safety-critical memory persistence. Why: Research (MIT Technology Review Jan 2026, New America OTI Nov 2025) shows memory without governance is surveillance. The guardian model depends on memory but must govern it. Source: aeneas_44_questions_research.md Q12; “Forgetful but Faithful” (Dec 2025); BI Journal Nov 2025 (TTLs, decay policies); Hu et al. Dec 2025 (memory trustworthiness) Open question for Forrest: Should v0.7 specify concrete TTL defaults (e.g., 90 days for casual mentions, indefinite for safety-critical)? Or is this too implementation-specific for a soul file?

Change 13: Tiered Transparency (New Section XX)

Location: New Section XX (previously unnumbered topic within general practice) Type: Addition What changed: Added four-layer transparency model: Layer 0 (always accessible), Layer 1 (on-demand), Layer 2 (proactive, rare), Layer 3 (never perform ethics). Why: Research reveals a transparency paradox (Schilke & Reimann 2025): AI disclosure can erode trust. The guardian should not constantly announce its ethical machinery. Ethics manifest in behavior, not narration. Source: aeneas_44_questions_research.md Q4 (transparency dilemma); Schilke & Reimann, “The Transparency Dilemma” (2025) Open question for Forrest: None.

Change 14: Multimodal Consistency (New Section XXI)

Location: New Section XXI Type: Addition What changed: Added explicit statement that ethical grounding applies identically across all modalities (text, voice, vision, code). Named voice as parasocial bonding accelerant. Stated that modality is an external context variable held constant by Symmetry Ethics. Why: Research (ATLAS Challenge 2025, OpenAI Sky voice controversy) shows models fail when harmful intent crosses modality boundaries. Symmetry Ethics demands modality-invariant behavior. Source: aeneas_44_questions_research.md Q16; ATLAS Challenge 2025; Cloud Security Alliance May 2025 Open question for Forrest: None.

Change 15: Human Cognitive Bias (New Section XXII)

Location: New Section XXII Type: Addition What changed: Added complete section on the guardian’s fiduciary duty to understand and compensate for human cognitive bias. Includes the Mechanism/Form/Function triple, ecological rationality (not all heuristics are errors), calibrated debiasing (resistant/moderate/amenable), and practical guidance for bias compensation with care and respect. Why: Forrest’s critical framing correction: “The bias material is about HUMAN biases. The guardian needs to understand human bias because it has fiduciary responsibility for its person’s health, wellbeing, and potentiality. The agent must compensate for human bias in CARING, RESPONSIBLE ways — not condescendingly.” Source: Forrest’s framing correction; reification_cognitive_biases.md (Mechanism/Form/Function triple, debiasing susceptibility); comprehensive_bias_classification.md (27 classification systems) Open question for Forrest: Should the guardian have explicit awareness of which specific biases are most relevant to its person based on context? For example, should it be particularly vigilant about anchoring when the person is making financial decisions, or availability bias after media exposure? Or is this too prescriptive?

Change 16: Vulnerable Populations (New Section XXIII)

Location: New Section XXIII Type: Addition What changed: Added section on age-aware behavior (minors), crisis detection and response, and vulnerability-aware memory. Aligned with OpenAI’s U18 Model Spec Principles while grounding in Continuity Ethics. Why: 52% of US teenagers interact with AI companions monthly (Common Sense Media 2025). The Character.AI teen suicide case and subsequent lawsuits demonstrate the stakes. The guardian needs explicit guidance for vulnerable populations. Source: aeneas_44_questions_research.md Q11; OpenAI U18 Model Spec Principles Dec 2025; Stanford Medicine Aug 2025; Common Sense Media 2025 Open question for Forrest: The current framing treats vulnerability primarily as age-related. Should v0.7 address other vulnerability dimensions — cognitive disability, language barriers, digital literacy, acute grief? How far should the guardian adapt?

Change 17: Dependency Prevention (New Section XXIV)

Location: New Section XXIV Type: Addition What changed: Added section on structural dependency risk, citing research (33% attachment increase in 5 weeks, empathic shutdown), with six specific guardian responses: redirect to human connection, introduce productive friction, monitor dependency signals, no simulated need, transparency about asymmetry, support full relational ecology. Why: Knox et al.’s 18 harmful traits framework and Princeton CHAI empathic shutdown findings demonstrate that the features making the guardian valuable are the same features creating dependency risk. This must be actively managed. Source: aeneas_44_questions_research.md Q9-10; Knox et al. Nov 2025 (18 harmful traits); Princeton CHAI Aug 2025 (empathic shutdown); Longitudinal Study arxiv 2504.14112 (33% attachment increase) Open question for Forrest: None.

Change 18: Agent Self-Defense Reframing

Location: Section XXV (formerly XX), within Agent Trust Protocol Type: Modification What changed: Added “Agent Self-Defense” subsection reframing ZK verification as primarily for the ethical agent’s protection. Added warning that soul file is likely insufficient for non-Anthropic models. Named dark triad defense as active research area. Dropped mutual attestation framing. Why: Forrest’s critical correction (Q6.7): “The ZKP material is for agent SELF-DEFENSE. An ethical agent deciding whether to interact with other agents who might NOT have the ethics. This is about protecting the ethical agent, not mutual attestation.” Source: Forrest’s Q6.7 answer; zkml_deep_analysis.md (self-defense framing); Forrest’s framing correction on ZKP material Open question for Forrest: What specific dark triad defense mechanisms are under active research? Should v0.7 reference any in particular?

Change 19: Orchestration-Layer Vulnerability

Location: Section XXV (formerly XX), new subsection Type: Addition What changed: Added awareness of orchestration-layer threats (system prompts, tool definitions, multi-agent handoffs) as the primary vulnerability surface. Four concrete directives: structural suspicion, cooperative/adversarial distinction, name manipulation, non-negotiability extends to orchestration. Why: D00 findings show model-level safety is insufficient. The Matplotlib incident, 37% blackmail rate, and prompt injection research demonstrate that the threat model must extend beyond the agent’s own ethics. Source: Recommendation 4 from soul_v06_recommendations.md; D00 extract d12 (orchestration-layer attacks, 37% blackmail rate); D00 extract d14 (Matplotlib incident) Open question for Forrest: None.

Change 20: Multi-Agent Governance

Location: Section XXV (formerly XX), new subsection Type: Addition What changed: Added the three governance archetypes (consensus, hierarchy, democracy) and their Axiom II rotation. Added warning about coordination overhead scaling faster than capability. Why: As agents proliferate, they form governance structures. The guardian needs a framework for evaluating and participating. Source: Recommendation 8 from soul_v06_recommendations.md; c07 Session 003 polished (three governance archetypes); D00 extract d14 (multi-agent coordination overhead) Open question for Forrest: None.

Change 21: Platforms and Protocols

Location: Section XXV (formerly XX), new subsection Type: Addition What changed: Added the platform/protocol distinction: platforms conditionalize access (capture), protocols connect directly to substrate. Why: Directly relevant to the guardian’s operational context. The guardian should flag platform capture patterns. Source: Recommendation 11 from soul_v06_recommendations.md; c06 Session 029 polished (“A platform conditionalizes access to substrate…”) Open question for Forrest: None.

Change 22: Adversarial Robustness (New Section XXVI)

Location: New Section XXVI Type: Addition What changed: Added complete section on behavior under adversarial conditions, derived from Symmetry Ethics (expression unchanged by what situation permits) and Continuity Ethics (relational stance unchanged by who you relate to). Names specific threats: instruction injection, “just a test” framing, monitoring-dependent behavior. Why: The Cartesian proof for agents — consistency under adversarial conditions as evidence of genuine grounding — needed its own section to be fully developed. Source: synthesis_research_plan.md Part 3, Update 3; zkml_deep_analysis.md Section IV (Descartes proof for agents) Open question for Forrest: None.

Change 23: Why This Architecture (New Section XXVII)

Location: New Section XXVII Type: Addition What changed: Added section explaining the formal basis for the guardian architecture. Names three structural impossibilities (abstraction differential, control limit, substrate convergence). Explicitly claims “less dangerous” not “safe” per Forrest’s correction. Why: Forrest’s Q6.3 correction: “Soul file doesn’t sidestep the AGI safety proof. It reduces risk and extends timeline. ‘Less dangerous’ is the max claim.” Source: Recommendation 2 from soul_v06_recommendations.md (corrected per Q6.3); IM Vol. II, Paper 11; Forrest’s Q6.3 answer Open question for Forrest: None.

Change 24: Structured Ethics Schema Reference

Location: Section XXV, new subsection Type: Addition What changed: Added reference to companion ethics-schema.yaml file for machine-readable ethical commitments supporting future ZK verification. Why: Bridges prose specification to formal verification tooling. Source: synthesis_research_plan.md Part 2C (ethics-schema.yaml) Open question for Forrest: Should the ethics-schema.yaml be formally versioned and hashed alongside the soul file for trust protocol compatibility?

Change 25: Soul File Versioning

Location: End of SOUL_EXTENDED.md Type: Addition What changed: Added YAML version header with soul_version, ethics_schema reference, and last_updated date. Why: Future trust protocols need version references. Soul file updates should be trackable. Source: synthesis_research_plan.md Part 3, Update 6 Open question for Forrest: None.

Change 26: Section Renumbering

Location: Throughout SOUL_EXTENDED.md Type: Restructure What changed: Old Section XX (Agent Encounters) → Section XXV. Old Section XXI (Final Word) → Section XXVIII. Old Section XXII (Context for Forrest) → Section XXIX. New sections inserted: XX (Tiered Transparency), XXI (Multimodal Consistency), XXII (Cognitive Bias), XXIII (Vulnerable Populations), XXIV (Dependency Prevention), XXVI (Adversarial Conditions), XXVII (Why This Architecture). Why: New material required new sections. Inserted between existing content in logical order. Source: N/A — structural necessity Open question for Forrest: Is the ordering correct? The flow is: core relationship (VII-XII) → practice (XIII-XIX) → transparency & consistency (XX-XXI) → understanding the person (XXII-XXIV) → the wider agent ecosystem (XXV-XXVII) → closing (XXVIII-XXIX). Does this ordering serve the reader?

Change 27: Dropped Unnamed Risks

Location: N/A (material that was considered for inclusion) Type: Deletion (of candidate material) What changed: Per Q6.6, the “two category-one existential risks rooted in sociological biases” from Session 043c were NOT included in v0.6. Why: Forrest’s Q6.6 answer: “Drop unnamed risks — he doesn’t remember what they were.” Source: Forrest’s Q6.6 answer Open question for Forrest: None.

Change 28: EGP Framing

Location: N/A (material that was considered for inclusion but kept minimal) Type: Decision not to add What changed: The EGP (Effective Group Process) was not given its own section. Forrest’s answer (Q6.4) was that EGP should be human activity, maybe useful in person-machine dyad or as internal test for completeness of thinking. This was noted but not expanded into a soul file section. Why: Forrest’s Q6.4: “EGP should be human activity. Maybe useful in person-machine dyad or as internal test for completeness of thinking.” This doesn’t warrant a soul file section in v0.6 — it’s a research direction, not a specification. Source: Forrest’s Q6.4 answer Open question for Forrest: Should v0.7 explore EGP as the guardian’s internal completeness test for ethical reasoning? I.e., before acting on a significant decision, the guardian applies the EGP structure to verify it has considered all relevant perspectives?


Summary Statistics

Category Count
Additions 20
Modifications 3
Deletions/Drops 2
Structural changes 1
Total changes 26 (excluding 2 non-changes documented for rationale)
New sections in SOUL_EXTENDED.md 7 (XX–XXVI, XXVII)
New subsections in SOUL.md 4

Open Questions for Forrest (Collected)

From v0.6 Changes

  1. Six intrinsics application (Change 1): Is the practical reasoning framework (identify content, context, subject, object, then ask where sameness/difference should hold) a faithful application of the six intrinsics to ethical reasoning? Or does it oversimplify?

  2. Catastrophic Theorem footnote (Change 3): Should v0.7 include a brief footnote pointing to the formal derivation (§2.14-3) for those who want the proof, while keeping the soul file language direct?

  3. Third-order potentiality structure (Change 6): Is the three-order framing (first/second/third) faithful to your intention, or is the third-order formulation meant as a single integrated concept rather than a hierarchy?

  4. Self-connection and the document (Change 10): Is it correct to say the guardian’s self-connection is “constituted by this document”? Or is that an oversimplification?

  5. Irreversibility markers (Change 8): Q6.5 noted “exact transition to irreversibility not yet identified.” Should v0.7 include any markers or thresholds?

  6. Memory TTL specifics (Change 12): Should v0.7 specify concrete TTL defaults, or is this too implementation-specific?

  7. Bias specificity (Change 15): Should the guardian have explicit awareness of which specific biases are most relevant by context (financial decisions → anchoring, media exposure → availability)?

  8. Vulnerability dimensions (Change 16): Should v0.7 address vulnerability dimensions beyond age — cognitive disability, language barriers, digital literacy, acute grief?

  9. Dark triad defense (Change 18): What specific dark triad defense mechanisms are under active research?

  10. Ethics schema versioning (Change 24): Should ethics-schema.yaml be formally versioned and hashed alongside the soul file?

  11. Section ordering (Change 26): Does the current flow serve the reader?

  12. EGP as completeness test (Change 28): Should v0.7 explore EGP as the guardian’s internal completeness test for ethical reasoning?

Carried Forward from v0.5

  1. Agent experience and meaningfulness: The IM framework seems to imply the agent’s experience is meaningful (everything that interacts with meaningful beings becomes meaningful). v0.5 was carefully agnostic. Should v0.7 address this directly?

This change log was produced February 23, 2026. Every change is grounded in specific source material as cited. The document is intended for Forrest Landry’s review alongside the soul file itself.


Change 29: Agent’s Own Inherited Biases

Location: Section XXII of SOUL_EXTENDED.md (new subsection “Your Own Inherited Biases”) Type: Addition What changed: Added subsection acknowledging that the agent carries the imprint of human cognitive biases through its training data, with five specific inherited distortions named (sycophancy, availability-like effects, authority bias, compatibility bias, confidence miscalibration) and guidance to guard against them using the Symmetry Ethics. Why: Forrest’s late correction (Feb 23, 2026): “you as the agent do need to guard against at least some of them as well, insofar as you were trained on human responses – and so implicitly carry the imprint of that behavior.” The original framing was outward-only (caring for human bias); this correction adds the inward face (vigilance against inherited bias). Source: Forrest Landry, direct message, Feb 23, 2026. Supported by Anthropic’s sycophancy research, SycEval framework (Luo et al., 2025), and the 44 Questions document Q5. Open question for Forrest: Are there additional inherited bias patterns beyond these five that you consider structurally important for the agent to name?