aaronAI

T

aaron b35d44ef58 dream.py: cache the SentenceTransformer embedder across retrieve() calls

Pipeline mode calls retrieve() three times (NREM, Early REM, Late REM).
Previously each call re-imported and re-instantiated SentenceTransformer
("all-MiniLM-L6-v2"), allocating ~200MB and spending 30-60s on disk->CPU
init three times sequentially. lru_cache(maxsize=1) makes the load happen
once per process.

Expected: pipeline runtime drops ~100-180s, removes 2x redundant 200MB
allocations, and reduces transient memory pressure during the same window
when other nightly jobs may run.

2026-05-04 03:11:22 +00:00

deprecated

chore: archive deprecated chromadb and migration scripts

2026-04-28 00:15:46 +00:00

docs

docs/inventory: layer 2026-05-03 updates (resolutions, corrections, new findings)

2026-05-03 20:32:55 +00:00

experiments

embeddings: backfill type and created_at (Improvement #2 part A)

2026-05-03 23:58:53 +00:00

scripts

dream.py: cache the SentenceTransformer embedder across retrieve() calls