aaronAI/scripts at b35d44ef587089027ab46d405e673d115a398860 - aaronAI - Aaron's Code

aaron/aaronAI

Files

T

History

aaron b35d44ef58 dream.py: cache the SentenceTransformer embedder across retrieve() calls

Pipeline mode calls retrieve() three times (NREM, Early REM, Late REM).
Previously each call re-imported and re-instantiated SentenceTransformer
("all-MiniLM-L6-v2"), allocating ~200MB and spending 30-60s on disk->CPU
init three times sequentially. lru_cache(maxsize=1) makes the load happen
once per process.

Expected: pipeline runtime drops ~100-180s, removes 2x redundant 200MB
allocations, and reduces transient memory pressure during the same window
when other nightly jobs may run.

2026-05-04 03:11:22 +00:00

..

scripts/: separate production from experimental and deprecated

2026-05-02 23:28:24 +00:00

embeddings: backfill type and created_at (Improvement #2 part A)

2026-05-03 23:58:53 +00:00

api.py

api.py: switch whisper to distil-large-v3, beam_size=1, cpu_threads=4

2026-05-04 01:00:32 +00:00

backup.sh

Update .gitignore, add backup script

2026-04-26 16:21:15 +00:00

corpus_integrity.py

scripts/encoding.py: Stage 1 dual-implementation consolidation (Track 1 Finding 11)

2026-05-03 01:40:47 +00:00

dream.py

dream.py: cache the SentenceTransformer embedder across retrieve() calls

2026-05-04 03:11:22 +00:00

encoding.py

embeddings: enforce type/created_at on writers; manifests carry type_distribution (Improvement #2 part B+C)

2026-05-04 00:15:43 +00:00

failures.py

scripts/encoding.py: Stage 1 dual-implementation consolidation (Track 1 Finding 11)

2026-05-03 01:40:47 +00:00

graphiti_service.py

graphiti_service.py: add traceback logging, log file handler for all endpoints

2026-04-30 17:36:19 +00:00

ingest_conversations.py

embeddings: enforce type/created_at on writers; manifests carry type_distribution (Improvement #2 part B+C)

2026-05-04 00:15:43 +00:00

ingest.py

scripts/encoding.py: Stage 1 dual-implementation consolidation (Track 1 Finding 11)

2026-05-03 01:40:47 +00:00

st_embedder.py

Add SentenceTransformer embedder for Graphiti — self-hosted, no OpenAI dependency

2026-04-27 18:18:37 +00:00

stage2_worker.py

stage2_worker: v2.1 — terminal failure states + sudo path fix

2026-05-01 17:28:53 +00:00

stage3_worker.py

stage3_worker: v2.2 — absolute sudo/systemctl paths, error logging, reset failure counter on recovery failure

2026-05-01 18:40:25 +00:00

watcher.py

scripts/encoding.py: Stage 1 dual-implementation consolidation (Track 1 Finding 11)

2026-05-03 01:40:47 +00:00