api.py: minor cleanups (Track 1 inventory findings)

- Fix /auth/check endpoint that referenced undefined SESSIONS (Phase 1 finding — would NameError 500 on every call). Now uses session_exists(token), the live session-validation mechanism defined elsewhere in api.py. - Remove unused DB_PATH ChromaDB-era constant (paired with the ChromaDB directory deletion and aaronai-maintenance.service removal earlier this session). Found by Track 1 inventory 2026-05-02. Cross-repo verification of share_time (third candidate from the original cleanup proposal) revealed it is working stores-and-returns persistence rather than dead code; share_time intentionally not modified. Inventory document edits are committed separately under the docs/ tracking decision.
api.py: wire up dream_mode setting (Track 1 Finding 9)
2026-05-02 23:59:20 +00:00 · 2026-05-02 23:38:29 +00:00 · 2026-05-02 23:28:24 +00:00 · 2026-05-02 22:56:33 +00:00 · 2026-05-02 21:33:49 +00:00
32 changed files with 18 additions and 8 deletions
@@ -34,7 +34,6 @@ from apscheduler.triggers.cron import CronTrigger
 load_dotenv(Path.home() / "aaronai" / ".env")

 MEMORY_PATH = Path.home() / "aaronai" / "memory.md"
-DB_PATH = str(Path.home() / "aaronai" / "db")
 CONVERSATIONS_DB = str(Path.home() / "aaronai" / "conversations.db")
 SETTINGS_PATH = Path.home() / "aaronai" / "settings.json"
 WATCHER_LOG = str(Path.home() / "aaronai" / "watcher.log")
@@ -50,7 +49,7 @@ DEFAULT_SETTINGS = {
    "show_sources": True,
    "dream_hour_utc": 8,
    "dream_minute_utc": 0,
-    "dream_mode": "nrem",
+    "dream_mode": "pipeline",
    "ingest_hour_utc": 2,
    "ingest_minute_utc": 30,
    "share_time": True,
@@ -382,7 +381,7 @@ async def logout(request: Request, response: Response):
@app.get("/auth/check")
 async def check_auth(request: Request):
    token = get_session(request)
-    if not token or token not in SESSIONS:
+    if not token or not session_exists(token):
        return JSONResponse({"authenticated": False})
    return JSONResponse({"authenticated": True})

@@ -1071,7 +1070,7 @@ async def corpus_retry(request: Request, auth: str = Depends(require_auth)):
            ON CONFLICT (source) DO UPDATE SET
                full_text = EXCLUDED.full_text, char_length = EXCLUDED.char_length,
                enqueued_at = NOW(), completed_at = NULL, failed_at = NULL, attempts = 0
-        """, (source, text[:50000], len(text)))
+        """, (source, text, len(text)))
        cur.execute("""
            UPDATE ingest_failures SET retry_count = retry_count + 1, last_failed_at = NOW()
            WHERE source = %s
@@ -1105,16 +1104,25 @@ async def corpus_reconcile(request: Request, background_tasks: BackgroundTasks,
 scheduler = BackgroundScheduler()

 def run_dream_job():
-    """Runs nightly dreamer — full interdependent pipeline, no mode flag."""
+    """Runs nightly dreamer at the mode set in settings.json (default: pipeline)."""
    try:
        import subprocess
+        settings = load_settings()
+        mode = settings.get("dream_mode", "pipeline")
+        valid_modes = {"pipeline", "nrem", "early-rem", "late-rem"}
+        if mode not in valid_modes:
+            print(f"Dreamer: invalid dream_mode={mode!r}; falling back to pipeline")
+            mode = "pipeline"
        dream_script = str(Path.home() / "aaronai" / "scripts" / "dream.py")
+        cmd = [PYTHON, dream_script]
+        if mode != "pipeline":
+            cmd += ["--mode", mode]
        result = subprocess.run(
-            [PYTHON, dream_script],
+            cmd,
            cwd=str(Path.home() / "aaronai"),
            capture_output=True, text=True, timeout=600
        )
-        print(f"Dreamer completed: {result.stdout[-200:] if result.stdout else 'no output'}")
+        print(f"Dreamer completed (mode={mode}): {result.stdout[-200:] if result.stdout else 'no output'}")
        if result.returncode != 0:
            print(f"Dreamer error: {result.stderr[-200:] if result.stderr else 'unknown'}")
    except Exception as e:
@@ -473,7 +473,9 @@ def dream_pipeline():

    # ── Stage 1: NREM ──────────────────────────────────────────────────────
    print("\n[NREM] Retrieving...")
-    nrem_chunks = retrieve("nrem", excluded_sources=previously_retrieved | session_retrieved)
+    # NREM is replay-and-consolidation — does not exclude prior traces.
+    # Late REM and Early REM exclude prior content for novelty; NREM does not.
+    nrem_chunks = retrieve("nrem", excluded_sources=None)
    session_retrieved.update(c["source"] for c in nrem_chunks)
    # Track sources that scored above Early REM ceiling — these are the only ones Early REM should exclude
    nrem_high_sources = {c["source"] for c in nrem_chunks if c["similarity"] > 0.55}
Author	SHA1	Message	Date
aaron	4b520b2bc2	api.py: minor cleanups (Track 1 inventory findings) - Fix /auth/check endpoint that referenced undefined SESSIONS (Phase 1 finding — would NameError 500 on every call). Now uses session_exists(token), the live session-validation mechanism defined elsewhere in api.py. - Remove unused DB_PATH ChromaDB-era constant (paired with the ChromaDB directory deletion and aaronai-maintenance.service removal earlier this session). Found by Track 1 inventory 2026-05-02. Cross-repo verification of share_time (third candidate from the original cleanup proposal) revealed it is working stores-and-returns persistence rather than dead code; share_time intentionally not modified. Inventory document edits are committed separately under the docs/ tracking decision.	2026-05-02 23:59:20 +00:00
aaron	7bebd8ae50	api.py: wire up dream_mode setting (Track 1 Finding 9) The dream_mode setting was defined in DEFAULT_SETTINGS and watched by update_settings for reschedule, but run_dream_job never read it — silently-ignored configuration. Two changes: 1. DEFAULT_SETTINGS["dream_mode"] flipped from "nrem" to "pipeline". The default was a latent regression vector: wiring up the setting without changing the default would have silently switched all default-config users from full-pipeline (current production behavior) to NREM-only nightly runs. 2. run_dream_job reads dream_mode at fire-time, validates against {"pipeline", "nrem", "early-rem", "late-rem"}, falls back to pipeline with a warning on invalid values. Lucid intentionally excluded — it is on-demand only by design and remains available via CLI and /api/dreamer/run. Nightly dream production behavior is unchanged for current users (no settings.json key → default "pipeline" → no flag passed → same as before). Users can now meaningfully change the nightly mode by editing settings.json or via the SettingsPanel. Found by Track 1 inventory 2026-05-02 (Finding 9 / divergence #9).	2026-05-02 23:38:29 +00:00
aaron	3f7fba7e0e	scripts/: separate production from experimental and deprecated Moves 28 experiment scripts to scripts/experiments/ (E1, E1.4, E1.6, E2, base_class, cascade, cost_test, briefing, consistency, token series). Moves 2 dissolved-layer scripts to scripts/deprecated/ (consolidator_v0_1.py, tier1_migration.py — under the bespoke decision both target retired substrate work). Removes 19 .bak* files from disk (gitignored, never tracked; git history is the durable record of every prior version). The 11 production scripts remain in scripts/. All systemd ExecStart paths, api.py subprocess calls, and cron jobs continue to resolve correctly — verified by grep against /etc/systemd/system/aaronai-*.service, scripts/ references in api.py, and the user crontab. Track 1 inventory cross-cutting finding: scripts/ mixed 11 production files with 32 experimental scripts and ~20 .bak files. After this commit a clean-room reader can identify the live workers from a directory listing alone. Found by Track 1 inventory 2026-05-02. See ~/aaronai/docs/scripts-reorg-plan-2026-05-02.md for full reasoning. After commit, run: 1. git log --oneline -3 — show the new commit on top 2. git status — confirm clean working tree (modulo the docs/ untracked files which are intentional)	2026-05-02 23:28:24 +00:00
aaron	6f2d274d5d	api.py: remove 50KB truncation from /api/corpus/retry (completes F14) The F14 fix on 2026-05-01 removed text[:50000] truncation from watcher.py, ingest.py, and corpus_integrity.py. The retry endpoint in api.py was missed — clicking 'Retry' on an ingest-failed file in the SettingsPanel re-introduced the exact truncation pattern F14 was meant to eliminate. Found by Track 1 inventory 2026-05-02 (Finding 2 / divergence #2).	2026-05-02 22:56:33 +00:00
aaron	7615dedf9e	dream: NREM does not exclude prior traces NREM in the reframe is replay-and-consolidation of recent encoded content. Excluding previously_retrieved sources turns NREM into novelty-finding, which is Late REM's job. NREM should re-traverse already-encoded content; that's what consolidation is. The May 2 abort surfaced this — 52 sources accumulated in the exclusion list, all of them in NREM's similarity band for the recurring research/fabrication/teaching query. The dreamer hit zero retrievable chunks not because the corpus was empty, but because everything semantically aligned was excluded. Late REM and Early REM keep the exclusion mechanism — novelty is their job. Session-scoped exclusion (nrem_high_sources flowing into Early REM) also preserved. The 500/400 trim on retrieved_sources is preserved for the remaining stages that still use it.	2026-05-02 21:33:49 +00:00