From 3529161b0f2dd577bfeee961c8b1fec526c49068 Mon Sep 17 00:00:00 2001 From: Ed_ Date: Sun, 21 Jun 2026 15:13:24 -0400 Subject: [PATCH] conductor(track): add TIER2_STARTER.md for video_analysis_campaign dispatch 3 prompt templates for Tier 2 autonomous agents: 1. Umbrella Tier 2 (Phase 0+1+2 init): installs tooling, builds 5 scripts, scaffolds 12 children 2. Per-child Tier 2 (one child's 5-phase pipeline): Acquire, Keyframes, OCR, Synthesis, Verification 3. Synthesis Tier 2 (after all 12 children): cross-cutting per_video_summary.md + report.md Includes: file-read order, key risks, hard constraints, verification criteria, per-track Tier 2 dispatch commands, and a quick-reference table. --- .../TIER2_STARTER.md | 243 ++++++++++++++++++ 1 file changed, 243 insertions(+) create mode 100644 conductor/tracks/video_analysis_campaign_20260621/TIER2_STARTER.md diff --git a/conductor/tracks/video_analysis_campaign_20260621/TIER2_STARTER.md b/conductor/tracks/video_analysis_campaign_20260621/TIER2_STARTER.md new file mode 100644 index 00000000..700e1a72 --- /dev/null +++ b/conductor/tracks/video_analysis_campaign_20260621/TIER2_STARTER.md @@ -0,0 +1,243 @@ +# Tier 2 Starter Prompt: Video Analysis Campaign + +**Purpose.** This file is the dispatch prompt for Tier 2 autonomous agents picking up tracks in the `video_analysis_campaign_20260621` campaign. It supplements the auto-loaded `spec.md` + `plan.md` per `conductor/tier2/commands/tier-2-auto-execute.md` step 2. + +**Two prompt templates below:** +1. **Umbrella Tier 2** — for Phase 0 (tooling) + Phase 1 (5 scripts) + Phase 2 initialization (12 child tracks scaffolded with plan.md/metadata.json/state.toml). +2. **Per-child Tier 2** — for executing one child's 5-phase pipeline (Acquire → Keyframes → OCR → Synthesis → Verification). + +--- + +## Template 1: Umbrella Tier 2 (Phases 0 + 1 + 2 init) + +``` +Dispatch Tier 2 with: /tier-2-auto-execute video_analysis_campaign_20260621 + +Plus this context (paste BEFORE invoking): + +--- +TRACK: video_analysis_campaign_20260621 +TYPE: Multi-track research campaign (1 umbrella + 12 children + 1 synthesis = 14 folders) +STATUS: spec_approved; awaiting Phase 0 (tooling prerequisites) +PRIORITY: A (user-blocking research campaign) + +PASS 1 OF 3 (multi-pass — load-bearing framing): +- Pass 1 (THIS): information extraction + distillation → 12 deep-dive reports + cross-cutting synthesis +- Pass 2 (FUTURE, USER-led): de-obfuscation via user's math encoding notation. USER must rediscover the encoding before Pass 2 starts. +- Pass 3 (FUTURE, USER-led): projection to user's applied domain. USER must articulate "own caveats" before Pass 3 starts. +- CRITICAL: Pass 1 artifacts MUST be lossless. Per-video target: 1000-10000 LOC markdown. Over-summarization here is data loss that cascades. + +FILES TO READ IN THIS ORDER (do not skip): + +1. /TIER2_STARTER.md (this file) +2. ./spec.md (full design — 15 sections, ~600 lines) +3. ./plan.md (Phase 0+1 bite-sized tasks; Phase 2-4 brief pointers) +4. ./metadata.json (scope, risk_register, verification_criteria, user_directives) +5. ./state.toml (current_phase, task tracking) +6. ./README.md (child index) + +THEN at session start (per conductor/workflow.md Standard Task Workflow): +7. /AGENTS.md (critical anti-patterns, file naming, no day estimates, skip-marker policy) +8. /conductor/workflow.md (task workflow, Tier 2 sandbox conventions, failcount contract) +9. /conductor/code_styleguides/python.md (1-space indent, type hints, no comments) +10. /conductor/code_styleguides/error_handling.md (Result[T] pattern for new scripts) + +REFERENCE SCRIPTS (consult as needed, DO NOT import): +- C:/projects/forth/bootslop/download_videos.py (yt-dlp usage) +- C:/projects/forth/bootslop/extract_frames.py (cv2 + imagehash) +- C:/projects/forth/bootslop/process_visuals.py (winsdk OCR + visual heuristics) +- C:/projects/forth/bootslop/ocr_interaction.py (standalone OCR) + +KEY RISKS (from metadata.json risk_register): +- R1 + R10 (HIGH, verified 2026-06-21): yt-dlp, cv2, imagehash, pillow NOT in repo venv. Phase 0 prerequisite. +- R5 (CONFIRMED for 2 videos): 9vM4p9NN0Ts, lVynu4bo1rY failed oEmbed 401. yt-dlp may still work; verify in Phase 1 of those child tracks. +- R7 (MEDIUM): Pass 1 over-summarization loses signal for Pass 2. Enforce 1000-10000 LOC floor per child report. +- R8 (MEDIUM): Tier 2 capacity for 12+ child tracks — each child is independently shippable; the campaign is async. + +HARD CONSTRAINTS: +- NO day/hour/minute estimates in any artifact. Scope measured in files/sites only. +- NO src/*.py changes. NO new pyproject.toml deps beyond the 4 packages installed in Phase 0. +- NO comments in source code. Documentation lives in /docs. +- 1-space indent on all Python. Type hints on all public functions. +- All new scripts follow Result[T] convention per /conductor/code_styleguides/error_handling.md. +- Test runner: uv run python scripts/run_tests_batched.py (NEVER uv run pytest directly). + +VERIFICATION CRITERIA (gate for campaign completion): +- All 12 child tracks shipped with report.md (1000-10000 LOC) + summary.md (200-400 words) + artifacts/ +- Synthesis track shipped with per_video_summary.md + report.md +- 5 scripts in scripts/video_analysis/ with passing TDD tests +- End-of-track report at docs/reports/TRACK_COMPLETION_video_analysis_campaign_20260621.md +- state.toml updated to status = "completed" + +EXECUTION PLAN: +- Phase 0: 4 install tasks (yt-dlp, cv2/imagehash/PIL, OCR backend, scripts/ namespace scaffold) +- Phase 1: 5 scripts with TDD (delegate each to Tier 3 worker via mma_exec.py --role tier3-worker) +- Phase 2: Initialize each child track (plan.md + metadata.json + state.toml) — 12 tracks total. Per-child 5-phase pipeline execution is a SEPARATE Tier 2 dispatch per child (see Template 2). +- Phase 3: Synthesis track (blocked by all 12 children). Initialize + dispatch Tier 3 for cross-cutting report. +- Phase 4: Closeout — update umbrella README.md, write end-of-track report, move 14 folders to archive/, update chronology.md. + +WHEN STUCK: +- Multi-pass question? Re-read spec.md §0 + §11. +- Tooling question? Reference bootslop scripts (don't import). +- Style question? Check /conductor/code_styleguides/ + AGENTS.md. +- Per-child question? Read the child spec.md for that slug. +- State/plan question? Update plan.md and state.toml atomically per the per-task commit protocol in /conductor/tier2/agents/tier2-autonomous.md. +--- +``` + +--- + +## Template 2: Per-Child Tier 2 (one child's 5-phase pipeline) + +After Template 1 completes, dispatch a new Tier 2 per child track: + +``` +Dispatch Tier 2 with: /tier-2-auto-execute video_analysis__20260621 --resume + +Where is one of: +- cs229_building_llms (E, video #1) +- probability_logic (A, video #2) +- entropy_epiplexity (A, video #3) +- score_dynamics_giorgini (A, video #4) +- platonic_intelligence_kumar (B, video #5) +- free_lunches_levin (B, video #6) +- generic_systems_fields (C, video #7) +- brain_counterintuitive (C, video #8) +- neural_dynamics_miller (C, video #9) +- multiscale_hoffman (C, video #10) +- cs336_architectures (E, video #11) +- creikey_dl_cv (D, video #12) + +Plus this context (paste BEFORE invoking): + +--- +TRACK: video_analysis__20260621 +TYPE: Per-child research track (one of 12 in the video_analysis_campaign_20260621 umbrella) +STATUS: spec ✓ (scaffolded by umbrella Tier 2); plan ✓ + metadata ✓ + state ✓ (created by umbrella Tier 2 in Phase 2 init) + +PASS 1 OF 3 (multi-pass campaign — load-bearing): +- This child track produces raw artifacts (transcript.json, frames/, ocr.md) + a 1000-10000 LOC report.md + a 200-400 word summary.md. +- These artifacts feed Pass 2 (de-obfuscation) and Pass 3 (projection). They MUST be lossless. +- DO NOT over-summarize. The Tier 3 worker prompt must specify "1000-10000 LOC" target. + +FILES TO READ IN THIS ORDER: + +1. ./spec.md (lightweight — references umbrella, gives video details, specifies 7 deliverables + 5-phase pipeline + 8-section report structure) +2. /conductor/tracks/video_analysis_campaign_20260621/TIER2_STARTER.md (this parent file — for cross-track context) +3. /conductor/tracks/video_analysis_campaign_20260621/spec.md (full umbrella design) +4. /conductor/tracks/video_analysis_campaign_20260621/plan.md (campaign-level plan) +5. /conductor/tracks/video_analysis_campaign_20260621/README.md (child index — confirm this is the right child) + +THEN at session start (if first Tier 2 invocation in this session): +6. /AGENTS.md +7. /conductor/workflow.md +8. /conductor/code_styleguides/python.md +9. /conductor/code_styleguides/error_handling.md + +PIPELINE (5 phases per umbrella spec §FR5): + +Phase 1: Acquire +- Run scripts/video_analysis/extract_transcript.py /artifacts/transcript.json +- Run scripts/video_analysis/download_video.py /artifacts/video.mp4 (unless skip_video_download=true) +- For E-cluster children (cs229_building_llms, cs336_architectures): yt-dlp may fail per R5 — if so, fall back to manual transcript sourcing if available, or escalate. +- Commit artifacts atomically. + +Phase 2: Keyframes +- Run scripts/video_analysis/extract_keyframes.py