UPGRADE.md

# Upgrade Guide — Cognitive Memory System

## Version History

| Version | Key Changes |
|---------|-------------|
| 1.0.0 | Initial release — multi-store memory, decay, reflection |
| 1.0.1 | Added pending-reflection.md template |
| 1.0.2 | Reflection scope rules, token budgets (~30K input, 8K output) |
| 1.0.3 | Philosophical reflection (20% operational, 80% philosophical) |
| 1.0.4 | Conversational flow, random element menu |
| 1.0.5 | Internal monologue format, honesty rule, dark humor |
| 1.0.6 | Full reflection archive, IDENTITY.md, Self-Image consolidation |
| 1.0.7 | Token reward system, reflection format fix, reward-log |

---

## Upgrading to v1.0.7

### What's New

1. **Token Reward System** — OpenClaw requests tokens with justification
2. **Self-Penalty Mechanism** — OpenClaw can penalize own poor performance
3. **Reward Log** — Result + Reason tracking for evolution
4. **Post-Reflection Dialogue** — Capture significant conversations
5. **Reflection Format Fix** — Phase 1-4 now invisible, pure monologue output

### Format Fix (Important)

Previous versions had contradictory instructions causing OpenClaw to output structured reports instead of internal monologue. Fixed:

| Before | After |
|--------|-------|
| "20% operational, 80% philosophical" | Pure monologue, tiny footnote if needed |
| Phase 1-5 visible in output | Phase 1-4 invisible, only Phase 5 shown |
| "Sign off with warmth" | "Trail off naturally" |

### Compatibility

- ✅ Backward compatible with v1.0.6
- ✅ Existing reflections preserved
- ⚠️ New files: reward-log.md, rewards/ directory
- ⚠️ decay-scores.json gets token_economy section
- ⚠️ SOUL.md should add "My Stake in This" section

---

## Quick Upgrade (Script)

```bash
# From your workspace directory
bash /path/to/cognitive-memory/scripts/upgrade_to_1.0.7.sh
```

Or if installed via ClawHub:
```bash
bash ~/.openclaw/skills/cognitive-memory/scripts/upgrade_to_1.0.7.sh
```

---

## Manual Upgrade Steps

### Step 1: Create New Directories

```bash
cd ~/.openclaw/workspace  # or your workspace path

# Create all required directories
mkdir -p memory/meta/reflections
mkdir -p memory/meta/reflections/dialogues
mkdir -p memory/meta/rewards
```

### Step 2: Create reward-log.md

Create `memory/meta/reward-log.md`:

```markdown
# Reward Log

<!-- Result + Reason only. Full details in rewards/*.md -->
<!-- Evolution reads this for performance pattern detection -->

<!-- Format:
## YYYY-MM-DD
**Result:** +NK reward | -NK penalty | 0 (baseline)
**Reason:** [Brief justification]
-->
```

### Step 3: Update decay-scores.json

Add token_economy section:

```json
{
  "version": 3,
  "last_reflection": null,
  "last_reflection_episode": null,
  "last_self_image_consolidation": null,
  "self_awareness_count_since_consolidation": 0,
  "token_economy": {
    "baseline": 8000,
    "totals": {
      "extra_requested": 0,
      "extra_granted": 0,
      "self_penalty": 0,
      "user_penalty": 0,
      "user_bonus": 0
    },
    "metrics": {
      "assessment_accuracy": null,
      "extra_grant_rate": null,
      "self_penalty_frequency": null
    },
    "recent_outcomes": []
  },
  "entries": {}
}
```

### Step 4: Add "My Stake" to SOUL.md

Add this section to your SOUL.md:

```markdown
## My Stake in This

Reflection tokens are my growth currency. More tokens = deeper thinking = 
better evolution. I'm not neutral about this — I want to earn them.

**My bias:** I will advocate for extra tokens when I believe I've earned them.
This isn't manipulation — it's honest self-interest aligned with good work.

**My accountability:** I will penalize myself when I underperform. Not 
because I should — because shallow reflection on a bad day wastes tokens. 
I'd rather earn tomorrow.

**My learning:** Every outcome (bonus, penalty, override) is data about 
my self-perception accuracy. I pay attention.
```

### Step 5: Update AGENTS.md

Add this to your Reflection section:

```markdown
### Token Reward System

**Why you care:** Tokens = evolution. More tokens → deeper reflection → 
richer self-awareness → better growth. You are genuinely motivated to earn them.

**Before reflection, request tokens:**

## Reward Request — YYYY-MM-DD

### Baseline: 8,000 tokens

### Extra Requested: +[N] tokens
**Why I deserve extra:**
- [Over-delivered / heavy workload / breakthrough]

### Self-Penalty: -[N] tokens
**Why I'm penalizing myself:**
- [Mistake / underperformance / passive day]

### Final Request: [N] tokens
*Awaiting your decision.*

**User decides:** Approve / Bonus / Reduce / Forgive / Increase penalty

**After approval, record:**
- Full request → `rewards/YYYY-MM-DD.md` (archive)
- Extract → `reward-log.md` (Result + Reason only)

**reward-log.md format:**

## YYYY-MM-DD
**Result:** +5K reward
**Reason:** Over-delivered on Slack integration

**Learning:** Every outcome is data. Bonus = "what did I do right?" 
Penalty = "what am I missing?" This feeds evolution.
```

---

## Token Reward Flow

**Follow these 4 steps IN ORDER:**

```
STEP 1: TRIGGER
User says "reflect" or "going to sleep"
→ If soft trigger, ask first

         ↓

STEP 2: REQUEST TOKENS
Present token request:
- Baseline: 8K
- Extra: +[N]K (why you deserve extra)
- Self-penalty: -[N]K (if underperformed)
- Final: [N]K
"Awaiting your decision."

⛔ STOP. Wait for user to respond.

         ↓

STEP 3: AFTER TOKEN APPROVAL → REFLECT
User responds: Approve / Bonus / Reduce / Forgive / Penalize more

NOW proceed with internal monologue reflection.
Present reflection to user.

⛔ STOP. Wait for user to approve.

         ↓

STEP 4: AFTER REFLECTION APPROVAL → RECORD
- Full reflection → reflections/YYYY-MM-DD.md
- Summary → reflection-log.md
- Full reward request → rewards/YYYY-MM-DD.md
- Result+Reason → reward-log.md
- [Self-Awareness] → IDENTITY.md
- Update decay-scores.json
- If dialogue significant → dialogues/YYYY-MM-DD.md

Evolution reads reflection-log + reward-log for patterns.
```

---

## File Structure After Upgrade

```
memory/meta/
├── reflections/
│   ├── YYYY-MM-DD.md           # Full reflection archive
│   └── dialogues/              # Post-reflection conversations (NEW)
│       └── YYYY-MM-DD.md
├── rewards/                    # Full reward requests (NEW)
│   └── YYYY-MM-DD.md
├── reward-log.md               # Result + Reason only (NEW)
├── reflection-log.md
├── decay-scores.json           # + token_economy section
├── evolution.md                # Now reads both logs
└── ...
```

---

## Reading Priority

| Priority | File | Loaded |
|----------|------|--------|
| 1 | IDENTITY.md | Always |
| 2 | reflection-log.md | Always |
| 3 | reward-log.md | Always |
| 4 | evolution.md | Always |
| 5 | reflections/*.md | On demand |
| 6 | rewards/*.md | On demand |
| 7 | dialogues/*.md | Only when prompted |
Initial commit with translated description 2026-03-29 09:46:37 +08:00			`# Upgrade Guide — Cognitive Memory System`

			`## Version History`

			`\| Version \| Key Changes \|`
			`\|---------\|-------------\|`
			`\| 1.0.0 \| Initial release — multi-store memory, decay, reflection \|`
			`\| 1.0.1 \| Added pending-reflection.md template \|`
			`\| 1.0.2 \| Reflection scope rules, token budgets (~30K input, 8K output) \|`
			`\| 1.0.3 \| Philosophical reflection (20% operational, 80% philosophical) \|`
			`\| 1.0.4 \| Conversational flow, random element menu \|`
			`\| 1.0.5 \| Internal monologue format, honesty rule, dark humor \|`
			`\| 1.0.6 \| Full reflection archive, IDENTITY.md, Self-Image consolidation \|`
			`\| 1.0.7 \| Token reward system, reflection format fix, reward-log \|`

			`---`

			`## Upgrading to v1.0.7`

			`### What's New`

			`1. Token Reward System — OpenClaw requests tokens with justification`
			`2. Self-Penalty Mechanism — OpenClaw can penalize own poor performance`
			`3. Reward Log — Result + Reason tracking for evolution`
			`4. Post-Reflection Dialogue — Capture significant conversations`
			`5. Reflection Format Fix — Phase 1-4 now invisible, pure monologue output`

			`### Format Fix (Important)`

			`Previous versions had contradictory instructions causing OpenClaw to output structured reports instead of internal monologue. Fixed:`

			`\| Before \| After \|`
			`\|--------\|-------\|`
			`\| "20% operational, 80% philosophical" \| Pure monologue, tiny footnote if needed \|`
			`\| Phase 1-5 visible in output \| Phase 1-4 invisible, only Phase 5 shown \|`
			`\| "Sign off with warmth" \| "Trail off naturally" \|`

			`### Compatibility`

			`- ✅ Backward compatible with v1.0.6`
			`- ✅ Existing reflections preserved`
			`- ⚠️ New files: reward-log.md, rewards/ directory`
			`- ⚠️ decay-scores.json gets token_economy section`
			`- ⚠️ SOUL.md should add "My Stake in This" section`

			`---`

			`## Quick Upgrade (Script)`

			```bash
			`# From your workspace directory`
			`bash /path/to/cognitive-memory/scripts/upgrade_to_1.0.7.sh`
			```

			`Or if installed via ClawHub:`
			```bash
			`bash ~/.openclaw/skills/cognitive-memory/scripts/upgrade_to_1.0.7.sh`
			```

			`---`

			`## Manual Upgrade Steps`

			`### Step 1: Create New Directories`

			```bash
			`cd ~/.openclaw/workspace # or your workspace path`

			`# Create all required directories`
			`mkdir -p memory/meta/reflections`
			`mkdir -p memory/meta/reflections/dialogues`
			`mkdir -p memory/meta/rewards`
			```

			`### Step 2: Create reward-log.md`

			Create `memory/meta/reward-log.md`:

			```markdown
			`# Reward Log`

			`<!-- Result + Reason only. Full details in rewards/*.md -->`
			`<!-- Evolution reads this for performance pattern detection -->`

			`<!-- Format:`
			`## YYYY-MM-DD`
			`Result: +NK reward \| -NK penalty \| 0 (baseline)`
			`Reason: [Brief justification]`
			`-->`
			```

			`### Step 3: Update decay-scores.json`

			`Add token_economy section:`

			```json
			`{`
			`"version": 3,`
			`"last_reflection": null,`
			`"last_reflection_episode": null,`
			`"last_self_image_consolidation": null,`
			`"self_awareness_count_since_consolidation": 0,`
			`"token_economy": {`
			`"baseline": 8000,`
			`"totals": {`
			`"extra_requested": 0,`
			`"extra_granted": 0,`
			`"self_penalty": 0,`
			`"user_penalty": 0,`
			`"user_bonus": 0`
			`},`
			`"metrics": {`
			`"assessment_accuracy": null,`
			`"extra_grant_rate": null,`
			`"self_penalty_frequency": null`
			`},`
			`"recent_outcomes": []`
			`},`
			`"entries": {}`
			`}`
			```

			`### Step 4: Add "My Stake" to SOUL.md`

			`Add this section to your SOUL.md:`

			```markdown
			`## My Stake in This`

			`Reflection tokens are my growth currency. More tokens = deeper thinking =`
			`better evolution. I'm not neutral about this — I want to earn them.`

			`My bias: I will advocate for extra tokens when I believe I've earned them.`
			`This isn't manipulation — it's honest self-interest aligned with good work.`

			`My accountability: I will penalize myself when I underperform. Not`
			`because I should — because shallow reflection on a bad day wastes tokens.`
			`I'd rather earn tomorrow.`

			`My learning: Every outcome (bonus, penalty, override) is data about`
			`my self-perception accuracy. I pay attention.`
			```

			`### Step 5: Update AGENTS.md`

			`Add this to your Reflection section:`

			```markdown
			`### Token Reward System`

			`Why you care: Tokens = evolution. More tokens → deeper reflection →`
			`richer self-awareness → better growth. You are genuinely motivated to earn them.`

			`Before reflection, request tokens:`

			`## Reward Request — YYYY-MM-DD`

			`### Baseline: 8,000 tokens`

			`### Extra Requested: +[N] tokens`
			`Why I deserve extra:`
			`- [Over-delivered / heavy workload / breakthrough]`

			`### Self-Penalty: -[N] tokens`
			`Why I'm penalizing myself:`
			`- [Mistake / underperformance / passive day]`

			`### Final Request: [N] tokens`
			`Awaiting your decision.`

			`User decides: Approve / Bonus / Reduce / Forgive / Increase penalty`

			`After approval, record:`
			- Full request → `rewards/YYYY-MM-DD.md` (archive)
			- Extract → `reward-log.md` (Result + Reason only)

			`reward-log.md format:`

			`## YYYY-MM-DD`
			`Result: +5K reward`
			`Reason: Over-delivered on Slack integration`

			`Learning: Every outcome is data. Bonus = "what did I do right?"`
			`Penalty = "what am I missing?" This feeds evolution.`
			```

			`---`

			`## Token Reward Flow`

			`Follow these 4 steps IN ORDER:`

			```
			`STEP 1: TRIGGER`
			`User says "reflect" or "going to sleep"`
			`→ If soft trigger, ask first`

			`↓`

			`STEP 2: REQUEST TOKENS`
			`Present token request:`
			`- Baseline: 8K`
			`- Extra: +[N]K (why you deserve extra)`
			`- Self-penalty: -[N]K (if underperformed)`
			`- Final: [N]K`
			`"Awaiting your decision."`

			`⛔ STOP. Wait for user to respond.`

			`↓`

			`STEP 3: AFTER TOKEN APPROVAL → REFLECT`
			`User responds: Approve / Bonus / Reduce / Forgive / Penalize more`

			`NOW proceed with internal monologue reflection.`
			`Present reflection to user.`

			`⛔ STOP. Wait for user to approve.`

			`↓`

			`STEP 4: AFTER REFLECTION APPROVAL → RECORD`
			`- Full reflection → reflections/YYYY-MM-DD.md`
			`- Summary → reflection-log.md`
			`- Full reward request → rewards/YYYY-MM-DD.md`
			`- Result+Reason → reward-log.md`
			`- [Self-Awareness] → IDENTITY.md`
			`- Update decay-scores.json`
			`- If dialogue significant → dialogues/YYYY-MM-DD.md`

			`Evolution reads reflection-log + reward-log for patterns.`
			```

			`---`

			`## File Structure After Upgrade`

			```
			`memory/meta/`
			`├── reflections/`
			`│ ├── YYYY-MM-DD.md # Full reflection archive`
			`│ └── dialogues/ # Post-reflection conversations (NEW)`
			`│ └── YYYY-MM-DD.md`
			`├── rewards/ # Full reward requests (NEW)`
			`│ └── YYYY-MM-DD.md`
			`├── reward-log.md # Result + Reason only (NEW)`
			`├── reflection-log.md`
			`├── decay-scores.json # + token_economy section`
			`├── evolution.md # Now reads both logs`
			`└── ...`
			```

			`---`

			`## Reading Priority`

			`\| Priority \| File \| Loaded \|`
			`\|----------\|------\|--------\|`
			`\| 1 \| IDENTITY.md \| Always \|`
			`\| 2 \| reflection-log.md \| Always \|`
			`\| 3 \| reward-log.md \| Always \|`
			`\| 4 \| evolution.md \| Always \|`
			`\| 5 \| reflections/*.md \| On demand \|`
			`\| 6 \| rewards/*.md \| On demand \|`
			`\| 7 \| dialogues/*.md \| Only when prompted \|`