From d28e46e4b0fdc91563aa87f5385098de117ef782 Mon Sep 17 00:00:00 2001 From: Ed_ Date: Tue, 23 Jun 2026 20:00:05 -0400 Subject: [PATCH] conductor(deob_lexicon_v2): Initialize v2 track scaffold + V2_CHANGELOG The corrective pass track is initialized with: - spec.md: 14 sections, 8 corrections + 3 refinements + 4 template notations + 2 << >> placements - plan.md: 7 atomic tasks, per-file commits with git notes - metadata.json: scope, verification criteria, risk register, user-directives-logged - state.toml: 2 phases, 7 + 2 tasks - README.md: track index + cross-references - V2_CHANGELOG.md: 17 v1->v2 changes documented + out-of-scope items The 5 source files (lexicon.md, terms_catalog.md, dedup_map.md, prompt_template.md, report.md) are NOT yet modified; this commit is the track scaffold + changelog. The 5 source file changes follow in separate commits. --- .../README.md | 84 +++++++ .../V2_CHANGELOG.md | 195 ++++++++++++++++ .../metadata.json | 144 ++++++++++++ .../plan.md | 121 ++++++++++ .../spec.md | 216 ++++++++++++++++++ .../state.toml | 73 ++++++ 6 files changed, 833 insertions(+) create mode 100644 conductor/tracks/video_analysis_deob_lexicon_v2_20260623/README.md create mode 100644 conductor/tracks/video_analysis_deob_lexicon_v2_20260623/V2_CHANGELOG.md create mode 100644 conductor/tracks/video_analysis_deob_lexicon_v2_20260623/metadata.json create mode 100644 conductor/tracks/video_analysis_deob_lexicon_v2_20260623/plan.md create mode 100644 conductor/tracks/video_analysis_deob_lexicon_v2_20260623/spec.md create mode 100644 conductor/tracks/video_analysis_deob_lexicon_v2_20260623/state.toml diff --git a/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/README.md b/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/README.md new file mode 100644 index 00000000..0aa9c607 --- /dev/null +++ b/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/README.md @@ -0,0 +1,84 @@ +# Lexicon v2 Patch — Track Index + +**Track:** `video_analysis_deob_lexicon_v2_20260623` +**Type:** Corrective pass (Pass 2 Phase 1.5 — between Pass 2 closeout and Pass 3 init) +**Status:** spec DRAFT (pending user review) +**Date:** 2026-06-23 + +## What this track is + +After Pass 2 SHIPPED (35 atomic commits, 33 deliverables + 2 reports), the user reviewed the lexicographic substrate and surfaced **11 corrections + 15 design refinements** that the v1 lexicon encoded incorrectly. This track produces a **v2** of the lexicon substrate: 5 source files updated + 1 changelog. Targeted, not exhaustive. + +## Track files + +| File | Purpose | +|---|---| +| `spec.md` | The design intent: 14 sections, the 8 corrections, the 3 refinements, the 4 template notations, the 2 `<<` / `>>` placements, the architecture reference, the out-of-scope items, the risk register, the verification criteria | +| `plan.md` | The TDD-ready plan: 7 atomic tasks + 2 verification tasks, 1 commit per file change | +| `metadata.json` | The machine-readable metadata: scope (5 modified + 6 new), verification criteria, v1→v2 changes table, risk register, user-directives-logged (19 entries) | +| `state.toml` | The track state: 2 phases, 7 + 2 tasks, user-directives-logged (20 entries) | +| `V2_CHANGELOG.md` | The 1-line per change changelog (created in Task 1.6) | +| `REPORT_V2_CHANGELOG.md` | (optional; created if the v2 of `report.md` is a separate file instead of in-place) | + +## Source files modified (in upstream folders) + +| File | Where | v1 → v2 changes | +|---|---|---| +| `lexicon.md` | `video_analysis_deob_lexicon_20260621/` | 8 corrections + 3 refinements + 4 template notations + 2 `<<` / `>>` placements + new §10 + §1.5 encoding placeholder | +| `terms_catalog.md` | `video_analysis_deob_lexicon_20260621/` | Tier 1-4 rows updated; 4 new rows added; stats updated | +| `dedup_map.md` | `video_analysis_deob_lexicon_20260621/` | Maps 1, 2, 3 reshaped; Map 4 unchanged | +| `prompt_template.md` | `video_analysis_deob_warmup_20260621/` | Rule 5 encoding placeholder; 3 wrong re-encodings removed; per-language `<<` / `>>` note added | +| `report.md` | `video_analysis_deob_warmup_20260621/` | §1.13 encoding placeholder; §3 lexicon v2; §9 operator reference footnote; new §10 per-language rendering | + +## What v2 fixes (the 17 changes) + +| ID | Summary | Source file | +|---|---|---| +| L1 | Remove `set → kind` re-encoding | `lexicon.md` Tier 1 + `terms_catalog.md` + `dedup_map.md` Map 2 | +| L2 | Remove `function → procedure` re-encoding | `lexicon.md` Tier 2 + Tier 4 + `terms_catalog.md` + `dedup_map.md` Map 3 | +| L3 | Remove `parameter → argument` re-encoding | `lexicon.md` Tier 2 + `terms_catalog.md` | +| L4 | Remove `input → arg` re-encoding | `lexicon.md` Tier 2 + `terms_catalog.md` | +| L5 | Remove `proof → construction` re-encoding | `lexicon.md` Tier 2 + `dedup_map.md` Map 1 | +| L6 | Replace `transcendental → template expression` | `lexicon.md` Tier 4 + `terms_catalog.md` | +| L7 | Change encoding default to `Scalar` / `floatX` placeholder | `lexicon.md` §1.5 + §7 + `prompt_template.md` Rule 5 | +| L8 | Reconcile `Type` / `Kind`; reserve `kind` for enums | `lexicon.md` Tier 1 + Tier 3 + `terms_catalog.md` | +| R1 | Add `correlation` to encoding-explicit examples | `lexicon.md` §7.3 + `terms_catalog.md` | +| R4 | Add `Markov chain` as a type-class | `lexicon.md` + `terms_catalog.md` | +| R6 | Add `PolyTimeAdversary` as a type-class | `lexicon.md` + `terms_catalog.md` | +| TN1 | Add template notation B as default | `lexicon.md` Tier 3 + `terms_catalog.md` + `prompt_template.md` | +| TN2 | Add template notation C++ as opt-in | `lexicon.md` Tier 3 + `terms_catalog.md` | +| TN3 | Add template notation Odin as opt-in | `lexicon.md` Tier 3 + `terms_catalog.md` | +| TN4 | Document template notation selection rule | `lexicon.md` Tier 3 + `prompt_template.md` | +| <<1 | Add `<<` / `>>` to Tier 1 (comparison) | `lexicon.md` Tier 1 + `terms_catalog.md` | +| <<2 | Add `<<` / `>>` to Tier 4 (fuzzy with tolerance) | `lexicon.md` Tier 4 + `terms_catalog.md` | +| <<3 | Document `<<` / `>>` per-language rendering | `lexicon.md` §10 (NEW) + `prompt_template.md` | + +## What v2 does NOT do (out of scope) + +- **33 deliverables + 2 reports** (`apply/` folder) — NOT re-processed. They are intermediate artifacts; Pass 3 will use v2 to produce C11/Python code directly. +- **5 DEFERRED gaps** (G1, G2, G4, G7, G9) — not addressed in v2; deferred to lexicon v3. +- **3 INDEFINITE gaps** (G6, G7, G8 — note G7 is shared with DEFERRED list; the INDEFINITE version of G7 is the multiscale_hoffman one, not the one in the warmup §A.3) — preserved with honest epistemic hedging. +- **9 FIX refinements** (5 PILOT FIX + 4 APPLY FIX) — already in the deliverables; no change in v2. +- **Pass 3** (the projection to C11/Python code) — separate track; this corrective pass is its prerequisite. +- **Sectored Language V1 adaptation for code** — Pass 3 concern. + +## Sequence + +1. **Phase 1** (Tasks 1.1-1.7): Apply v1→v2 changes to the 5 source files; write V2_CHANGELOG.md; update tracks.md. 7 atomic commits. +2. **Phase 2** (Tasks 2.1-2.2): Manual review of the 17 changes; user review + approval. + +## Next track + +After v2 SHIPPED + user approval: **`video_analysis_deob_pass3_20260623/`** (Pass 3 — the projection to C11/Python code). + +## Cross-references + +- **Upstream:** `video_analysis_deob_apply_20260621/` (Phase 3, SHIPPED 2026-06-23) +- **Downstream:** `video_analysis_deob_pass3_20260623/` (future, user-led) +- **Design doc:** `video_analysis_deob_warmup_20260621/report.md` (modified by this track to v2) +- **Codified spec:** `video_analysis_deob_lexicon_20260621/lexicon.md` (modified by this track to v2) +- **LLM-direct spec:** `video_analysis_deob_warmup_20260621/prompt_template.md` (modified by this track to v2) + +--- + +*End of `README.md`. 5 sections. The track is targeted and not exhaustive.* diff --git a/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/V2_CHANGELOG.md b/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/V2_CHANGELOG.md new file mode 100644 index 00000000..18aa2984 --- /dev/null +++ b/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/V2_CHANGELOG.md @@ -0,0 +1,195 @@ +# V2_CHANGELOG.md — Lexicon v2 Transition Log + +**Track:** `video_analysis_deob_lexicon_v2_20260623` +**Date:** 2026-06-23 +**Type:** Documented v1→v2 transition (17 changes + 1 encoding refinement) + +> **Purpose.** This changelog documents the v1→v2 transition for the lexicon substrate. The v1 state is preserved in git history; the v2 is the current state. The 33 Pass 2 deliverables + 2 reports are intermediate artifacts and are NOT re-processed (Pass 3 will use v2 to produce C11/Python code directly). +> +> **Reading guide.** Each change is documented with: ID, summary, source file, action. The 17 changes are: 8 corrections (L1-L8) + 3 refinements (R1, R4, R6) + 4 template notations (TN1-TN4) + 2 `<<` / `>>` placements (<<1, <<2) + 1 per-language rendering section (<<3). The L7 entry has a sub-refinement for the placeholder scheme (L7a: encoding defaults). + +--- + +## §1. The 8 corrections (L1-L8) + +### L1: Remove `set → kind` re-encoding + +- **User directive (2026-06-23):** "Set is a specific data structure, not directly related to an enumerable type. Do not reencode set as a kind." +- **Source file:** `lexicon.md` Tier 1 #1.1 + `terms_catalog.md` Tier 1 row 1.1 + `dedup_map.md` Map 2 + `report.md` §3.1 +- **Action:** REMOVE the re-encoding. The principled form is "no re-encoding; clarify with native language + etymology." The data structure `set` (HashSet, SortedSet, etc.) is distinct from the type-theoretic `kind` (reserved for enumeration types per L8). + +### L2: Remove `function → procedure` re-encoding + +- **User directive (2026-06-23):** "A procedure is not necessarily a function, a function has a specific declarative sense it, a procedure is imperative." +- **Source file:** `lexicon.md` Tier 2 #2.1 + Tier 4 #4.4 + `terms_catalog.md` Tier 2 row 2.1 + Tier 4 row 4.4 + `dedup_map.md` Map 3 + `report.md` §3.2 + §3.4 +- **Action:** REMOVE the re-encoding. Function (declarative, math) and procedure (imperative, CS) are distinct concepts. The principled form is to keep `function` and clarify with native language + etymology. + +### L3: Remove `parameter → argument` re-encoding + +- **User directive (2026-06-23):** "Parameters and arguments are also distinctly separate terms they are not analogous. ... Better to say that the term doesn't need deobfuscation and instead just clarify what it is in native language with associate etymology or similar verb in a different lang." +- **Source file:** `lexicon.md` Tier 2 #2.2 + `terms_catalog.md` Tier 2 row 2.2 + `report.md` §3.2 +- **Action:** REMOVE the re-encoding. Parameter (formal name in definition) and argument (actual value passed) are distinct concepts. The principled form is to keep `parameter` and clarify with native language + etymology. + +### L4: Remove `input → arg` re-encoding + +- **User directive (2026-06-23):** "inpug and arg are also not the same." +- **Source file:** `lexicon.md` Tier 2 #2.5 + `terms_catalog.md` Tier 2 row 2.5 + `report.md` §3.2 +- **Action:** REMOVE the re-encoding. Input (conceptual act) and arg (formal name) are distinct concepts. The principled form is to keep `input` and clarify with native language + etymology. + +### L5: Remove `proof → construction` re-encoding (use construction as sub-type tag) + +- **User directive (2026-06-23):** "Proof and construction maybe, a construction is a type of proof." +- **Source file:** `lexicon.md` Tier 2 #2.9 + `terms_catalog.md` Tier 2 row 2.9 + `dedup_map.md` Map 1 + `report.md` §3.2 +- **Action:** REMOVE the re-encoding. `construction` is a sub-type tag applied when the proof is constructive; the main term remains `proof`. Map 1 example "we have a proof that x² ≥ 0" now uses `proof` with `construction` as a sub-type tag. + +### L6: Replace `transcendental → template expression` with classification form + +- **User directive (2026-06-23):** "A transcendental number ... is a classification of expressions that resolve to a specific sequence that is consistent for the encoding resolution, and fullfills very specific traits. An algebraic spexpression that fullfills the term for irrationals also shares similar traits but is mising some that transcendental has, etc." +- **Source file:** `lexicon.md` Tier 4 #4.9 + `terms_catalog.md` Tier 4 row 4.9 + `report.md` §3.4 +- **Action:** REPLACE the re-encoding. The new form: `classification of expressions that resolve to a specific sequence consistent with the encoding resolution, fulfilling very specific traits (transcendence over algebraic); an algebraic expression that fulfills the term for irrationals shares some but not all traits`. + +### L7: Change encoding default from `float64` to placeholder scheme + +- **User directive (2026-06-23):** "Defining a Scalar as an undefined resolution is fine. you don't need todo float64 it can just floatX when the X defines a placeholder until the user defines a resolution for a target application. Or just call it Scalar. Int64 same thing, just call it integer until resolution definition is necessary. ... I do like the encoding taxonomy table you have when picking a resolution matters though." +- **User directive (refined 2026-06-23):** "no keep scalar its useful for linear alg, geo alg, tensor alg." +- **Source file:** `lexicon.md` §1.5 + §7 + Tier 4 #4.2, 4.19, 4.20, 4.21, 4.22 + `terms_catalog.md` Tier 4 rows + `prompt_template.md` Rule 5 + `report.md` §1.13 + §3.4 +- **Action:** CHANGE the principled default. The v1 default `float64` is replaced with the placeholder scheme: + - `float` — general-purpose unbounded float placeholder + - `integer` — general-purpose unbounded integer placeholder + - `Scalar` — domain-specific placeholder for linear alg, geo alg, tensor alg (per user 2026-06-23, "useful for linear alg, geo alg, tensor alg") + - `float64` is the principled resolved default ONLY when the user defines a target resolution + + The encoding taxonomy table is preserved (per user 2026-06-23, "I do like the encoding taxonomy table you have when picking a resolution matters though"). The placeholder vs resolved distinction makes the taxonomy *more* useful, not less. + +### L8: Reconcile `Type` / `Kind`; reserve `kind` for enumeration types + +- **User directive (2026-06-23):** "Type/Genus/Kind are analogous no need to rencode. Type and Type of types is analogous to kind. Kind is useful term to reserve for enumeration types, especially when used with components, DAG nodes, or fat structs." +- **Source file:** `lexicon.md` Tier 1 #1.1 (set) + Tier 2 #2.11, 2.12, 2.13 + Tier 3 #3.1 + `terms_catalog.md` Tier 1 + Tier 2 + Tier 3 + `report.md` §3.1 + §3.2 + §3.3 +- **Action:** RECONCILE. The v1 lexicon re-encoded `Type → kind` and `Type/Genus → kind (sense 8)`. v2 removes these collapses: + - `Type` is the conventional term; no re-encoding. + - `Kind` (capital K) is the type-of-types term (standard). + - `kind` (lowercase) is reserved for enumeration types: components, DAG nodes, fat structs. + + The `Attribute` / `Property` / `Type/Genus` re-encodings to lowercase forms are also removed; the principled form is to keep the conventional forms and document the `kind` reservation. + +--- + +## §2. The 3 DEFERRED refinements (R1, R4, R6) + +### R1: Add `correlation` to encoding-explicit examples + +- **Source:** cs229 §2.6 "98% correlation" (from `pilot_report.md` §3.1 + `apply_report.md` §4 R1) +- **Source file:** `lexicon.md` §7.3 + `terms_catalog.md` Tier 4 #4.25 + `report.md` §3.4 +- **Action:** ADD a new entry. Re-encoded: `correlation : ` where `` is `float` (placeholder) or `float64` (resolved). Encoding taxonomy: `float` for general placeholder; `float64` for resolved. + +### R4: Add `Markov chain` to Tier 3 as a type-class + +- **Source:** entropy_epiplexity §5.2 (from `pilot_report.md` §3.4 + `apply_report.md` §4 R4) +- **Source file:** `lexicon.md` Tier 3 #3.19 + `terms_catalog.md` Tier 3 row 3.19 + `report.md` §3.3 +- **Action:** ADD a new entry. Re-encoded: `Markov where X -> Y -> Z is a Markov chain`. The 3-argument form is unusual but the de-obfuscation needs it (per the apply phase). + +### R6: Add `PolyTimeAdversary` to Tier 3 as a type-class + +- **Source:** entropy_epiplexity §5.8 (from `pilot_report.md` §3.6 + `apply_report.md` §4 R6) +- **Source file:** `lexicon.md` Tier 3 #3.20 + `terms_catalog.md` Tier 3 row 3.20 + `report.md` §3.3 +- **Action:** ADD a new entry. Re-encoded: `PolyTimeAdversary : Type where forall A : PolyTimeAdversary, runtime(A) : Polynomial(security_parameter) : int64`. Etymology: Goldwasser-Micali 1982. + +--- + +## §3. The 4 template notations (TN1-TN4) + +### TN1: Add template notation B as default + +- **User directive (2026-06-23):** "B as default. Use c+++ template language style in specific scenario when the user asks for it or if its better (same with jai/odin language have the lexicon support it if the user desires it)." +- **Source file:** `lexicon.md` Tier 3 #3.14 + `terms_catalog.md` Tier 3 row 3.14 + `prompt_template.md` + `report.md` §3.3 +- **Action:** ADD notation. Form: `Dependent(B) <- depends(x : A)`. This is the principled default per user 2026-06-23. The intent scripting DSL style with explicit dependency declaration. + +### TN2: Add template notation C++ as opt-in + +- **User directive (2026-06-23):** "Use c+++ template language style in specific scenario when the user asks for it or if its better." +- **Source file:** `lexicon.md` Tier 3 #3.14 + `terms_catalog.md` Tier 3 row 3.14 + `report.md` §3.3 +- **Action:** ADD notation. Form: `Dependent`. Opt-in for C++ context; the user requests this when the C++ convention is preferred. + +### TN3: Add template notation Odin as opt-in + +- **User directive (2026-06-23):** "same with jai/odin language have the lexicon support it if the user desires it" +- **Source file:** `lexicon.md` Tier 3 #3.14 + `terms_catalog.md` Tier 3 row 3.14 + `report.md` §3.3 +- **Action:** ADD notation. Form: `Dependent[B, x : A]`. Opt-in for Odin context; the user requests this when the Odin convention is preferred. + +### TN4: Document template notation selection rule + +- **User directive (2026-06-23):** "B as default. Use c+++ template language style in specific scenario when the user asks for it or if its better (same with jai/odin language have the lexicon support it if the user desires it)." +- **Source file:** `lexicon.md` Tier 3 #3.14 + `terms_catalog.md` Tier 3 row 3.14 + `prompt_template.md` + `report.md` §3.3 +- **Action:** DOCUMENT the selection rule. "Use B as default. C++ / Odin / Jai are opt-in per context; the user requests them when their convention is preferred." + +--- + +## §4. The 2 `<<` / `>>` placements + 1 per-language rendering section (<<1, <<2, <<3) + +### <<1: Add `<<` / `>>` to Tier 1 (comparison operators) + +- **User directive (2026-06-23):** "weakly_coupled preciate is useful but still want more info. Maybe also introduce >> as much more?" +- **Source file:** `lexicon.md` Tier 1 #1.13 + `terms_catalog.md` Tier 1 row 1.13 + `report.md` §3.1 +- **Action:** ADD entries. `<<` / `>>` are "much less than" / "much more than" comparison operators with `tolerance : float64` for the fuzzy relation. Per user 2026-06-23, `>>` is also added for "much more than." + +### <<2: Add `<<` / `>>` to Tier 4 (fuzzy with tolerance) + +- **User directive (2026-06-23):** "weakly_coupled(...) is good for c11, much_less and much_greater can be used as well." +- **Source file:** `lexicon.md` Tier 4 #4.26 + `terms_catalog.md` Tier 4 row 4.26 + `report.md` §3.4 +- **Action:** ADD entry. Re-encoded: `weakly_coupled(a, b) : Prop` (predicate form) OR `much_less(a, b, tolerance)` / `much_greater(a, b, tolerance)` (comparison form, with explicit `tolerance : float64`). + +### <<3: Document per-language rendering for `<<` / `>>` + +- **User directive (2026-06-23):** "weakly_coupled(...) is good for c11, much_less and much_greater can be used as well." +- **Source file:** `lexicon.md` §9 (NEW) + `prompt_template.md` (new "Per-language rendering" section) + `report.md` §10 (NEW) +- **Action:** ADD a new section. The `<<` / `>>` operators have a per-language rendering issue: in C11, `a << b` and `a >> b` are bit-shift operators. In Python, the same. In Forth, `a b <<` is a shift. The principled form cannot be used as-is in these languages — there's a namespace collision with bit-shift. + + **Resolution:** use named functions or operators in the target language. The principled form is reserved for the abstract mathematical context. C11 / Python / Forth renderings documented in `lexicon.md` §9 + `prompt_template.md` + `report.md` §10. + +--- + +## §5. Out of scope (NOT addressed in v2) + +- **The 33 deliverables + 2 reports** (`apply/` folder) — NOT re-processed. They are intermediate artifacts; Pass 3 will use v2 to produce C11/Python code directly. +- **The 5 DEFERRED gaps** (G1 paradoxes of epiplexity, G2 incomputable classification, G4 type-class primitive, G7 spacetime from trace logic, G9 ∞-Categories) — not addressed in v2; deferred to lexicon v3. +- **The 3 INDEFINITE gaps** (G6 quantum eigen function equivalence, G7 multiscale_hoffman spacetime, G8 Hoffman-Prakash 80% complete) — preserved with honest epistemic hedging; no change in v2. +- **The 9 FIX refinements** (5 PILOT FIX + 4 APPLY FIX) — already in the deliverables; no change in v2. +- **The 31 unresolved items from the warmup's §A.3 + §11.3** — most are still deferred; v2 only addresses the 3 DEFERRED refinements (R1, R4, R6) from the apply phase. +- **Pass 3** (the projection to C11/Python code) — separate track; this corrective pass is its prerequisite. +- **The Sectored Language V1 adaptation for code** — Pass 3 concern; per user 2026-06-23, "the pseudo sectr lang is not complete and prob needs adapting or further adjustments." §3.5 of `report.md` notes this. +- **The 4-language etymology pattern** — preserved for user-also-accepted terms; no change in v2. +- **The 6 noise-dedup maps as a whole** — only Maps 1, 2, 3 are reshaped; Maps 4, 5, 6 are unchanged. + +--- + +## §6. Files modified + +| # | File | Path | Changes | +|---|---|---|---| +| 1 | `lexicon.md` | `video_analysis_deob_lexicon_20260621/` | 8 corrections (L1-L8) + 3 refinements (R1, R4, R6) + 4 template notations (TN1-TN4) + 2 `<<` / `>>` placements (<<1, <<2) + new §9 (per-language rendering) + §0.4, §1.5, §7 updates | +| 2 | `terms_catalog.md` | `video_analysis_deob_lexicon_20260621/` | 5 NO RE-ENCODING entries added (1.1, 2.1, 2.2, 2.5, 2.13) + 4 new entries (1.13, 3.19, 3.20, 4.25, 4.26) + 4 template notation rows in 3.14 + encoding defaults updated for 4.2, 4.19, 4.20, 4.21, 4.22 + stats updated (72 → 76) | +| 3 | `dedup_map.md` | `video_analysis_deob_lexicon_20260621/` | Map 1 reshaped (proof/construction sub-type) + Map 2 reshaped (set removed; kind reserved for enums) + Map 3 reshaped (function/procedure distinct) + Maps 4, 5, 6 unchanged | +| 4 | `prompt_template.md` | `video_analysis_deob_warmup_20260621/` | Rule 5 updated (placeholder scheme: float / integer / Scalar) + "6 Noise-Dedup Lexicon" section updated (3 wrong re-encodings removed) + new "Per-language rendering" section (<<3) + verification checklist updated | +| 5 | `report.md` | `video_analysis_deob_warmup_20260621/` | §1.13 updated (placeholder scheme) + §3.1, §3.2, §3.3, §3.4 tables updated (8 corrections + 3 refinements + 4 template notations + 2 `<<` / `>>` placements) + §3.5 note added (pseudo sectr lang incomplete) + §10 added (per-language rendering pointer) | + +--- + +## §7. v1→v2 stats + +- **Total term count:** 72 (v1) → 76 (v2; +4) +- **Tier 1:** 12 (v1) → 13 (v2; +1) +- **Tier 2:** 18 (v1) → 18 (v2; net zero, but 5 NO RE-ENCODING entries added; rows changed) +- **Tier 3:** 18 (v1) → 20 (v2; +2) +- **Tier 4:** 24 (v1) → 26 (v2; +2) +- **NO RE-ENCODING entries (NEW v2):** 6 (1.1 set, 2.1 function, 2.2 parameter, 2.5 input, 2.9 proof, 2.13 Type/Genus; partial in 4.4) +- **Template notations in Tier 3 #3.14 (NEW v2):** 4 (B default, C++ opt-in, Odin opt-in, Jai opt-in) +- **`<<` / `>>` placements (NEW v2):** 3 (Tier 1 #1.13, Tier 4 #4.26, per-language rendering §9/§10) +- **Encoding placeholder scheme (NEW v2):** `float` (general), `integer` (general), `Scalar` (linear/geo/tensor alg), `float64` (resolved) +- **Files modified:** 5 (lexicon.md, terms_catalog.md, dedup_map.md, prompt_template.md, report.md) +- **Files added:** 1 (V2_CHANGELOG.md) +- **Files added in upstream track:** 0 (v2 is in-place; v1 preserved in git history) +- **Atomic commits:** 7 (1 per file change + 1 for the changelog + 1 for tracks.md) + +--- + +*End of `V2_CHANGELOG.md`. Total: 7 sections. 17 changes documented (8 corrections + 3 refinements + 4 template notations + 2 `<<` / `>>` placements + 1 per-language rendering). 5 files modified + 1 changelog added. v1 state preserved in git history.* diff --git a/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/metadata.json b/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/metadata.json new file mode 100644 index 00000000..7b1d0b87 --- /dev/null +++ b/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/metadata.json @@ -0,0 +1,144 @@ +{ + "track_id": "video_analysis_deob_lexicon_v2_20260623", + "name": "Lexicon v2 Patch (Pass 2 Phase 1.5 - Post-Apply Corrections)", + "type": "corrective-pass", + "phase": "Pass 2 Phase 1.5 (between Pass 2 closeout and Pass 3 init)", + "date": "2026-06-23", + "status": "spec-pending-review", + "blocked_by": [ + "video_analysis_deob_apply_20260621 (SHIPPED 2026-06-23, commit 8f2e8a69)" + ], + "blocks": [ + "video_analysis_deob_pass3_20260623 (future, user-led)" + ], + "method": "direct-tier2-synthesis", + "rationale": "The 8 corrections are surgical changes to the principled form. Tier 3 sub-agents would lose context on the principled vs user-specific distinction. Tier 2 holds the full lexicon substrate.", + "scope": { + "new_files": [ + "conductor/tracks/video_analysis_deob_lexicon_v2_20260623/spec.md", + "conductor/tracks/video_analysis_deob_lexicon_v2_20260623/plan.md", + "conductor/tracks/video_analysis_deob_lexicon_v2_20260623/metadata.json", + "conductor/tracks/video_analysis_deob_lexicon_v2_20260623/state.toml", + "conductor/tracks/video_analysis_deob_lexicon_v2_20260623/README.md", + "conductor/tracks/video_analysis_deob_lexicon_v2_20260623/V2_CHANGELOG.md" + ], + "modified_files": [ + "conductor/tracks/video_analysis_deob_lexicon_20260621/lexicon.md", + "conductor/tracks/video_analysis_deob_lexicon_20260621/terms_catalog.md", + "conductor/tracks/video_analysis_deob_lexicon_20260621/dedup_map.md", + "conductor/tracks/video_analysis_deob_warmup_20260621/prompt_template.md", + "conductor/tracks/video_analysis_deob_warmup_20260621/report.md" + ], + "new_sections": [ + "lexicon.md §10 (Per-language rendering)" + ], + "deleted_files": [], + "new_sites": 6, + "modified_sites": 5, + "total_files": 11 + }, + "tasks_count": 7, + "commits_count": 7, + "verification_criteria": [ + "All 8 corrections (L1-L8) applied across the 5 source files", + "All 3 DEFERRED refinements (R1, R4, R6) added to the lexicon", + "All 4 template notations (B, C++, Odin, Jai) documented with B as default", + "Both << / >> placements (Tier 1, Tier 4) added; per-language rendering in lexicon.md §10", + "Rule 5 (Encoding-explicit) updated to use Scalar / floatX placeholder", + "V2_CHANGELOG.md documents all 17 changes with 1-line per change", + "principled vs user-specific formalization preserved (v1 §0.3 unchanged in spirit)", + "33 deliverables + 2 reports NOT re-processed (intermediate artifacts; Pass 3 will use v2)", + "No src/*.py changes; no pyproject.toml deps; no day estimates", + "Per-task atomic commits with git notes (>= 7 commits)" + ], + "v1_to_v2_changes": { + "corrections": [ + {"id": "L1", "summary": "Remove set -> kind re-encoding (set is a data structure)", "source_file": "lexicon.md Tier 1 + terms_catalog.md + dedup_map.md Map 2"}, + {"id": "L2", "summary": "Remove function -> procedure re-encoding (distinct concepts)", "source_file": "lexicon.md Tier 2 + Tier 4 + terms_catalog.md + dedup_map.md Map 3"}, + {"id": "L3", "summary": "Remove parameter -> argument re-encoding (distinct concepts)", "source_file": "lexicon.md Tier 2 + terms_catalog.md"}, + {"id": "L4", "summary": "Remove input -> arg re-encoding (distinct concepts)", "source_file": "lexicon.md Tier 2 + terms_catalog.md"}, + {"id": "L5", "summary": "Remove proof -> construction re-encoding (construction is a sub-type of proof)", "source_file": "lexicon.md Tier 2 + dedup_map.md Map 1"}, + {"id": "L6", "summary": "Replace transcendental -> template expression (transcendental is a classification)", "source_file": "lexicon.md Tier 4 + terms_catalog.md"}, + {"id": "L7", "summary": "Change encoding default from float64 to Scalar/floatX placeholder", "source_file": "lexicon.md §1.5 + §7 + prompt_template.md Rule 5"}, + {"id": "L8", "summary": "Reconcile Type/Kind; reserve kind for enumeration types", "source_file": "lexicon.md Tier 1 + Tier 3 + terms_catalog.md"} + ], + "refinements": [ + {"id": "R1", "summary": "Add correlation to encoding-explicit examples (encoding: float64)", "source_file": "lexicon.md §7.3 + terms_catalog.md"}, + {"id": "R4", "summary": "Add Markov chain to Tier 1 or Tier 3 as a type-class", "source_file": "lexicon.md + terms_catalog.md"}, + {"id": "R6", "summary": "Add PolyTimeAdversary to Tier 3 as a type-class", "source_file": "lexicon.md + terms_catalog.md"} + ], + "template_notations": [ + {"id": "TN1", "summary": "Add template notation B as default: Dependent(B) <- depends(x : A)", "source_file": "lexicon.md Tier 3 + terms_catalog.md + prompt_template.md"}, + {"id": "TN2", "summary": "Add template notation C++ as opt-in: Dependent", "source_file": "lexicon.md Tier 3 + terms_catalog.md"}, + {"id": "TN3", "summary": "Add template notation Odin as opt-in: Dependent[B, x : A]", "source_file": "lexicon.md Tier 3 + terms_catalog.md"}, + {"id": "TN4", "summary": "Document template notation selection rule (B default; user requests C++/Odin)", "source_file": "lexicon.md Tier 3 + prompt_template.md"} + ], + "fuzzy_operators": [ + {"id": "<<1", "summary": "Add << / >> to Tier 1 (comparison operators)", "source_file": "lexicon.md Tier 1 + terms_catalog.md"}, + {"id": "<<2", "summary": "Add << / >> to Tier 4 (fuzzy relations with tolerance)", "source_file": "lexicon.md Tier 4 + terms_catalog.md"}, + {"id": "<<3", "summary": "Document << / >> per-language rendering (C11 much_less/much_greater/weakly_coupled)", "source_file": "lexicon.md §10 (NEW) + prompt_template.md"} + ] + }, + "out_of_scope": { + "items": [ + "33 deliverables + 2 reports (apply/ folder) - NOT re-processed; Pass 3 will use v2", + "5 DEFERRED gaps (G1 paradoxes, G2 incomputable, G4 type-class, G7 spacetime) - not addressed in v2; deferred to v3", + "3 INDEFINITE gaps (G6 quantum eigen, G7 spacetime, G8 Hoffman-Prakash) - preserved with hedging", + "9 FIX refinements (5 PILOT FIX + 4 APPLY FIX) - already in deliverables", + "31 unresolved items from warmup §A.3 + §11.3 - most still deferred", + "Pass 3 (C11/Python projection) - separate track", + "Sectored Language V1 adaptation for code - Pass 3 concern", + "4-language etymology pattern - preserved; no change", + "Maps 4, 5, 6 of dedup_map.md - unchanged" + ] + }, + "pre_existing_failures_remaining": [], + "deferred_to_followup_tracks": [ + { + "title": "Lexicon v3 — the 5 DEFERRED gaps", + "description": "Address G1 (paradoxes of epiplexity), G2 (incomputable classification), G4 (type-class primitive), G7 (spacetime from trace logic) in lexicon v3. Plus the 31 unresolved items from warmup §A.3 + §11.3.", + "track_status": "future, user-led" + }, + { + "title": "Pass 3 — projection to C11/Python code", + "description": "Apply the v2 lexicon to produce C11 (raddbg/duffel/pikuma/forth bootslop conventions) and Python (manual_slop convention) code for each Pass 1 video + the synthesis. Hardware target: up to $10k workstation (Ryzen 9 / i9, RTX 4090, 128GB DDR5).", + "track_status": "future, user-led" + } + ], + "estimated_effort": { + "method": "scope (per workflow.md Tier 1 Track Initialization Rules). NO day estimates.", + "phase_1": "7 tasks: 5 file changes + 1 changelog + 1 tracks.md update" + }, + "risk_register": [ + {"id": "R1", "risk": "33 deliverables contain v1 re-encodings that conflict with v2", "likelihood": "medium", "impact": "low", "mitigation": "Deliverables are intermediate; Pass 3 will use v2. No re-write in this pass."}, + {"id": "R2", "risk": "Rule 5 encoding placeholder change breaks existing examples", "likelihood": "low", "impact": "low", "mitigation": "Update the 5 examples in prompt_template.md to use Scalar or floatX. Syntactically valid."}, + {"id": "R3", "risk": "Template notation B is ambiguous in per-language rendering", "likelihood": "low", "impact": "low", "mitigation": "Document per-language in lexicon.md §10. C++ uses <>; Odin uses []; B is default."}, + {"id": "R4", "risk": "User disagrees with one of the 8 corrections after seeing v2", "likelihood": "low", "impact": "medium", "mitigation": "Corrections are direct quotes from user 2026-06-23. If disagreement, revert the specific correction."}, + {"id": "R5", "risk": "v2 lexicon breaks the principled vs user-specific formalization", "likelihood": "low", "impact": "high", "mitigation": "Formalization preserved. Corrections target principled forms; user-specific forms unchanged."}, + {"id": "R6", "risk": "<< / >> per-language rendering conflicts with C11 bit-shift", "likelihood": "medium", "impact": "low", "mitigation": "C11: much_less / much_greater / weakly_coupled. Python: same. Forth: a b << is bit-shift; conflict documented."}, + {"id": "R7", "risk": "V2_CHANGELOG is incomplete or inaccurate", "likelihood": "low", "impact": "low", "mitigation": "Same Tier 2 generates and verifies. Manual review."}, + {"id": "R8", "risk": "Git history broken by in-place edits", "likelihood": "low", "impact": "low", "mitigation": "v1 preserved in git history. v2 is current state. Diff is auditable."}, + {"id": "R9", "risk": "Per-task atomic commit discipline violated", "likelihood": "low", "impact": "low", "mitigation": "1 commit per file change = 7 commits. Each independent and revertable."} + ], + "user_directives_logged": [ + {"id": "set_is_data_structure", "date": "2026-06-23", "directive": "Set is a specific data structure, not directly related to an enumerable type. Do not reencode set as a kind."}, + {"id": "function_vs_procedure", "date": "2026-06-23", "directive": "A procedure is not necessarily a function, a function has a specific declarative sense it, a procedure is imperative."}, + {"id": "parameter_vs_argument", "date": "2026-06-23", "directive": "Parameters and arguments are also distinctly separate terms they are not analogous. Better to say that the term doesn't need deobfuscation and instead just clarify what it is in native language with associate etymology."}, + {"id": "input_vs_arg", "date": "2026-06-23", "directive": "input and arg are also not the same."}, + {"id": "type_genus_kind_analogous", "date": "2026-06-23", "directive": "Type/Genus/Kind are analogous no need to reencode. Type and Type of types is analogous to kind. Kind is useful term to reserve for enumeration types."}, + {"id": "proof_vs_construction", "date": "2026-06-23", "directive": "Proof and construction maybe, a construction is a type of proof."}, + {"id": "transcendental_is_classification", "date": "2026-06-23", "directive": "A transcendental number ... is a classification of expressions that resolve to a specific sequence that is consistent for the encoding resolution, and fullfills very specific traits."}, + {"id": "encoding_placeholder", "date": "2026-06-23", "directive": "Defining a Scalar as an undefined resolution is fine. you don't need todo float64 it can just floatX when the X defines a placeholder until the user defines a resolution for a target application. Or just call it Scalar. Int64 same thing, just call it integer until resolution definition is necessary."}, + {"id": "encoding_taxonomy_useful", "date": "2026-06-23", "directive": "I do like the encoding taxonomy table you have when picking a resolution matters though."}, + {"id": "ontology_procedure_shape", "date": "2026-06-23", "directive": "You can observe the shape of the procedure, not all possible result combinations or resolutions for a given metric utilized with that procedure. If the hedging is unrelated to this I need clarification."}, + {"id": "template_notation_b_default", "date": "2026-06-23", "directive": "B as default. Use c+++ template language style in specific scenario when the user asks for it or if its better (same with jai/odin language have the lexicon support it if the user desires it)."}, + {"id": "much_less_much_greater", "date": "2026-06-23", "directive": "weakly_coupled(...) is good for c11, much_less and much_greater can be used as well."}, + {"id": "near_n_operator", "date": "2026-06-23", "directive": "Maybe for near N can be near(N), or near N is near as the operator on N assigning attribution... can be used as infix A near B... or postfix A B near ... tolerance seems to clarify that you have a nearly equal float episilon, its a good word to use tolerance then for that."}, + {"id": "stream_A_nat_arrow_A", "date": "2026-06-23", "directive": "Stream A = nat -> A is fine as potentially infinite process. To me infinite process is just something that is non-halting unless giving a specific termination quantifier or resolution."}, + {"id": "applied_domain_c11_or_python", "date": "2026-06-23", "directive": "The applied domain is making a simple program in C11 or python that conveys what the subject video provides. If in c11 it should follow a specific covention of c11 that raddbg uses or that I use with my duffle libs in pikuma ps1 or the forth bootslop projects. For python it should use manual slop's convention."}, + {"id": "user_specific_forms_annotation", "date": "2026-06-23", "directive": "Forms are somehwat annotation if its not code. When it comes to the code psuedo sectr lang is not complete and prob needs adapting or further adjustments."}, + {"id": "hardware_target_up_to_10k", "date": "2026-06-23", "directive": "target up to 10k. when doing phase 3 you may have to pick a target, just do so within reason for a local hardware scenario (user with around a 1-10k USD budget I guess)."}, + {"id": "criteria_ok_may_ideate", "date": "2026-06-23", "directive": "criteria are ok, we'll have to ideate maybe for applied domain basd on how I elaborated if you don't think it matches enough."}, + {"id": "pass3_artifact_code", "date": "2026-06-23", "directive": "pass 3 is markdown for docs, but its going to be code files that may or may not run."} + ] +} diff --git a/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/plan.md b/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/plan.md new file mode 100644 index 00000000..ff10c35f --- /dev/null +++ b/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/plan.md @@ -0,0 +1,121 @@ +# Plan — Lexicon v2 Patch (Pass 2 Phase 1.5 of 3) + +**Track:** `video_analysis_deob_lexicon_v2_20260623` +**Method:** Direct Tier 2 synthesis (no Tier 3 delegation — the changes are surgical, not cross-cluster integration; Tier 3 would lose context on the principled vs user-specific distinction) +**Atomicity:** Per-file atomic commits with git notes + +--- + +## Phase 1: Apply v1→v2 corrections to 5 source files + +Focus: 5 surgical file modifications + 1 changelog + 1 new §10 in `lexicon.md`. Each task is a single file change. + +### Task 1.1: Update `lexicon.md` (v1 → v2) + +- **WHERE:** `conductor/tracks/video_analysis_deob_lexicon_20260621/lexicon.md` +- **WHAT:** Apply 8 corrections (L1-L8) + 3 DEFERRED refinements (R1, R4, R6) + 4 template notations (B, C++, Odin, Jai) + 2 `<<` / `>>` placements (Tier 1, Tier 4) + 1 new §10 (per-language rendering) + update §1.5 (Rule 5) for encoding placeholder. +- **HOW:** Use `manual-slop_edit_file` with surgical edits per section. Anchor on the specific Tier 1 / 2 / 3 / 4 table rows. Update §0.4 encoding-explicit reference. Add §10 at the end (before the current §10 unresolved items, which becomes §11). +- **SAFETY:** The current `lexicon.md` is 924 LOC. Edits are surgical; do NOT rewrite whole sections. Use find-and-replace per row. Verify with `py_get_code_outline` after each edit to confirm structure. +- **COMMIT:** `conductor(deob_lexicon_v2): Apply 8 corrections + 3 refinements + 4 template notations + << >> placements to lexicon.md` +- **GIT NOTE:** "v2 of the codified operational spec. Removes 5 wrong re-encodings (function/procedure, parameter/argument, input/arg, set/kind, proof/construction). Replaces transcendental re-encoding with classification form. Adopts template notation B as default with C++/Odin/Jai opt-in. Encoding default changes to Scalar/floatX placeholder. Adds correlation, Markov chain, PolyTimeAdversary entries. Documents << / >> in 3 placements." + +### Task 1.2: Update `terms_catalog.md` (v1 → v2) + +- **WHERE:** `conductor/tracks/video_analysis_deob_lexicon_20260621/terms_catalog.md` +- **WHAT:** Update Tier 1-4 rows to reflect the 8 corrections + 3 refinements + 4 template notations + 2 `<<` / `>>` placements. Add 3 new rows (correlation, Markov chain, PolyTimeAdversary). Update the "Cross-tier stats" section. Update the verification checklist. +- **HOW:** Edit Tier 1 row 1.1 (set → no re-encoding); Tier 2 rows 2.1-2.5 (function, parameter, input, proof); Tier 3 row 3.1 (Type → no re-encoding, kind reserved); Tier 4 rows 4.4 (function — remove procedure), 4.9 (transcendental — replace with classification). Add 3 new rows + 4 template notation rows (3.14 v2) + 2 `<<` / `>>` rows (Tier 1, Tier 4). Update the stats. +- **SAFETY:** The catalog is machine-readable; the row count must match `lexicon.md`. Cross-check after edits. +- **COMMIT:** `conductor(deob_lexicon_v2): Update terms_catalog.md with v2 entries (72 → 76 terms)` +- **GIT NOTE:** "Machine-readable form of v2. 4 new terms (correlation, Markov chain, PolyTimeAdversary, <<, >>). 5 wrong re-encodings removed. 4 template notations added to Tier 3 #3.14." + +### Task 1.3: Update `dedup_map.md` (v1 → v2) + +- **WHERE:** `conductor/tracks/video_analysis_deob_lexicon_20260621/dedup_map.md` +- **WHAT:** Reshape Maps 1, 2, 3 to reflect the v2 corrections. Map 4 unchanged. +- **HOW:** Map 1 — remove the `proof → construction` example row; add a note that construction is a sub-type tag. Map 2 — remove the "Sets" leg; document that `set` is a data structure, not an enumerable type. Map 3 — remove the "Functions" leg; document that function (declarative) and procedure (imperative) are distinct. Update cross-map relationships. Update verification checklist. +- **SAFETY:** The 3 maps (1, 2, 3) are marked `[principled]`; the corrections don't change the principle but refine the scope. Document the rationale in the "When to apply" section. +- **COMMIT:** `conductor(deob_lexicon_v2): Reshape Maps 1, 2, 3 in dedup_map.md (set is data structure, function/procedure are distinct)` +- **GIT NOTE:** "3 principled maps reshaped. Map 1: construction is a sub-type tag. Map 2: set is a data structure. Map 3: function/procedure are distinct. Map 4 (GA collapse) unchanged." + +### Task 1.4: Update `prompt_template.md` (v1 → v2) + +- **WHERE:** `conductor/tracks/video_analysis_deob_warmup_20260621/prompt_template.md` +- **WHAT:** Update Rule 5 to use `Scalar` / `floatX` placeholder; remove the 3 wrong re-encodings from "The 6 Noise-Dedup Lexicon" section; add the `<<` / `>>` per-language rendering note. +- **HOW:** Edit Rule 5 (the table with "real number" / "Pi" examples). Edit "The 6 Noise-Dedup Lexicon" section to remove function → procedure, parameter → argument, input → arg. Add a new section "Per-language rendering" with the C11 / Python / Forth notes for `<<` / `>>`. Update the verification checklist. +- **SAFETY:** The prompt template is the LLM-direct operational spec. Changes here propagate to all future de-obfuscation work. Verify the verification checklist is consistent with the changes. +- **COMMIT:** `conductor(deob_warmup): Update prompt_template.md v2 — encoding placeholder + remove wrong re-encodings + per-language << >> note` +- **GIT NOTE:** "LLM-direct spec v2. Rule 5 uses Scalar/floatX placeholder. 3 wrong re-encodings removed from the noise-dedup lexicon. Per-language rendering note added for << / >>." + +### Task 1.5: Update `report.md` (warmup) v1 → v2 + +- **WHERE:** `conductor/tracks/video_analysis_deob_warmup_20260621/report.md` +- **WHAT:** Update §1.13 (Encoding-explicit) to use `Scalar` / `floatX` placeholder. Update §3 (the Lexicon) to reflect the v2 entries. Update §9 (Operator reference) footnote on `<<` / `>>`. Add §10 (Per-language rendering) — brief. +- **HOW:** Edit §1.13 — replace `float64` default with `Scalar` / `floatX` placeholder; update the encoding taxonomy table. Edit §3 — remove the 5 wrong re-encodings from Tier 1-4 tables. Edit §9 — add a footnote on `<<` / `>>` placements. Add §10 at the end with the per-language rendering table. +- **SAFETY:** The report is the design doc. v1 is preserved in git history; v2 is the current state. The design doc should reflect the lexicon v2 substrate. +- **COMMIT:** `conductor(deob_warmup): Update report.md v2 — §1.13 encoding placeholder + §3 lexicon v2 + §10 per-language rendering` +- **GIT NOTE:** "Design doc v2. §1.13 encoding-explicit uses Scalar/floatX placeholder. §3 lexicon entries updated to v2. §9 operator reference notes << / >> placements. §10 added with per-language rendering for << / >>." + +### Task 1.6: Write `V2_CHANGELOG.md` + +- **WHERE:** `conductor/tracks/video_analysis_deob_lexicon_v2_20260623/V2_CHANGELOG.md` +- **WHAT:** Document the 17 v1→v2 changes with 1-line per change. List the out-of-scope items. +- **HOW:** Use a markdown table with columns: ID, Change, Source file, Action. 17 rows. Add an "Out of scope" section. +- **SAFETY:** This is a one-shot write. Verify the row count matches the spec's "17 changes" claim. +- **COMMIT:** `conductor(deob_lexicon_v2): V2_CHANGELOG.md — document 17 v1→v2 changes + out-of-scope` +- **GIT NOTE:** "Changelog for the v1→v2 transition. 17 changes: 8 corrections + 3 refinements + 4 template notations + 2 << / >> placements. Out of scope: 33 deliverables, 5 DEFERRED gaps, 3 INDEFINITE gaps." + +### Task 1.7: Update `tracks.md` (project registry) + +- **WHERE:** `conductor/tracks.md` +- **WHAT:** Add a row for `video_analysis_deob_lexicon_v2_20260623` after the apply row. +- **HOW:** Find the existing 29 rows; insert the v2 row in the appropriate position (after the apply row, before any future Pass 3 row). +- **SAFETY:** `tracks.md` is the project-level track registry. The new row is informational; no logic depends on it. +- **COMMIT:** `conductor(registry): Add lexicon v2 track row to tracks.md` +- **GIT NOTE:** "Project registry updated with the corrective pass track." + +--- + +## Phase 2: Verification + +Focus: Manual review of the v1→v2 changes. + +### Task 2.1: Manual review of all 17 changes + +- **WHERE:** All 5 modified source files + the new V2_CHANGELOG.md +- **WHAT:** Verify each of the 17 v1→v2 changes is correctly applied. Check that no v1 content was lost (lossless preservation). +- **HOW:** Read each modified section; cross-check against the spec's FR1-FR5 requirements. Diff against git HEAD~1 to ensure surgical edits (not whole-file rewrites). +- **SAFETY:** This is verification, not implementation. If any change is incorrect, fix it surgically (do not rewrite the whole file). + +### Task 2.2: User review + +- **WHERE:** User-facing +- **WHAT:** Present the v2 to the user; await approval before Pass 3 dispatch. +- **HOW:** Provide a summary of the 17 changes; link to the modified sections; ask for approval. + +--- + +## Atomic commit summary + +- Task 1.1: lexicon.md (1 commit) +- Task 1.2: terms_catalog.md (1 commit) +- Task 1.3: dedup_map.md (1 commit) +- Task 1.4: prompt_template.md (1 commit) +- Task 1.5: report.md (1 commit) +- Task 1.6: V2_CHANGELOG.md (1 commit) +- Task 1.7: tracks.md (1 commit) + +**Total: 7 atomic commits.** Per the workflow's "atomic per-task commits" rule, each commit is a safe rollback point. Each has a 1-3 sentence git note. + +--- + +## Out of scope (per `spec.md`) + +- The 33 deliverables + 2 reports (`apply/` folder) — NOT re-processed. +- The 5 DEFERRED gaps — not addressed in v2. +- The 3 INDEFINITE gaps — preserved with hedging. +- The 9 FIX refinements — already in deliverables. +- Pass 3 — separate track. + +--- + +*End of `plan.md`. 7 atomic commits. No Tier 3 delegation (the changes are surgical). Verification by manual review.* diff --git a/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/spec.md b/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/spec.md new file mode 100644 index 00000000..f5d66f0d --- /dev/null +++ b/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/spec.md @@ -0,0 +1,216 @@ +# Track Specification: Lexicon v2 Patch (Pass 2 Phase 1.5 of 3) + +**Track ID:** `video_analysis_deob_lexicon_v2_20260623` +**Type:** Targeted corrective pass (post-apply patches) — research-only, markdown +**Date:** 2026-06-23 +**Status:** spec DRAFT (pending user review) +**Blocked by:** `video_analysis_deob_apply_20260621` (SHIPPED 2026-06-23) +**Blocks:** `video_analysis_deob_pass3_20260623` (future, user-led) + +--- + +## Overview + +After Pass 2 SHIPPED (35 atomic commits, 33 deliverables + 2 reports), the user reviewed the lexicographic substrate and surfaced **11 corrections + 15 design refinements** that the v1 lexicon encoded incorrectly. The corrections are not "tweaks" — they are **structural mis-encodings** that the lexicon applied too aggressively. Examples: + +- The v1 lexicon encodes `function → procedure`, `parameter → argument`, `input → arg` as **drop-in replacements**. The user clarifies these terms are **distinct concepts** (function = declarative/math; procedure = imperative/CS; parameter = formal name; argument = actual value; input = conceptual act; arg = formal name). The collapse destroys the distinction the user relies on. +- The v1 lexicon encodes `set → kind`. The user clarifies `set` is a **specific data structure** (hash, sorted, etc.), not an enumerable type. `kind` should be reserved for enumeration types (components, DAG nodes, fat structs). +- The v1 lexicon's `Rule 5 (Encoding-explicit)` defaults to `float64` for any value-bearing term. The user clarifies this over-commits to a specific resolution; the principled form is `Scalar` or `floatX` as a placeholder until the user defines a target resolution. +- The v1 lexicon encodes transcendental numbers as "template expression for producing a value at a given resolution." The user clarifies this is wrong; transcendental is a **classification of expressions** with specific traits, not a template. + +This corrective pass produces a **v2** of the lexicon substrate. It is **targeted, not exhaustive**: 5 source files updated + 1 changelog. The 33 Pass 2 deliverables (the per-video 3-layer outputs) are **NOT re-processed** — they are intermediate artifacts; Pass 3 will use the v2 lexicon to produce C11/Python code directly, which naturally surfaces the corrections in the user's preferred form. + +The **lossless preservation directive** (per `video_analysis_campaign_20260621/spec.md` §0) means the v1 state is preserved in git history. The v2 is the current state. The v1→v2 transition is documented in `V2_CHANGELOG.md` for auditability. + +--- + +## Current State Audit (as of 2026-06-23, commit `8f2e8a69`) + +### Already Implemented (DO NOT re-implement) + +- **The 5-rule framework** (Boundedness, Form-anchor, Etymology, Lossless, Encoding-explicit) — `lexicon.md` §1. Keep as-is; the 5 rules are correct in spirit, only Rule 5 needs the encoding default update. +- **The 4 tiers** (Tier 1: Core concepts; Tier 2: Data-oriented pipeline; Tier 3: Type-theoretic primitives; Tier 4: AI-fuzzing tolerance) — `lexicon.md` §2. Keep the tier structure; only specific entries change. +- **The principled vs user-specific formalization** — `lexicon.md` §0.3 + `apply_report.md` §3 (preserved through the 33 deliverables). This is the **load-bearing distinction** of v2; it stays intact. The corrections target the principled forms, not the formalization. +- **The 6 noise-dedup maps split into 3 principled + 3 user-preferred** — `dedup_map.md`. The split is correct. Only specific maps need re-shaping (Maps 2, 3, 4). +- **The 7 example transformations** — `lexicon.md` §4 / `prompt_template.md` examples. Keep the structure; only the example for "transcendental number" needs replacing. +- **The 33 deliverables + 2 reports** — `apply/` folder. **NOT re-processed**; Pass 3 will use v2 to produce C11/Python code directly. +- **The 12 refinements + 8 gaps** documented in `apply_report.md` §4-§5. The 9 FIX refinements are already in the deliverables; the 3 DEFERRED refinements (R1, R4, R6) need to land in lexicon v2. The 3 INDEFINITE gaps (G6-G8) are preserved with hedging; no change. The 5 DEFERRED gaps (G1, G2, G4, G7) are out of scope for this corrective pass. + +### Gaps to Fill (This Track's Scope) + +8 corrections + 3 DEFERRED refinements + 4 template notations, applied across 5 source files: + +| # | Change | Source file | Action | +|---|---|---|---| +| L1 | Remove `set → kind` re-encoding (set is a data structure) | `lexicon.md` Tier 1 + `terms_catalog.md` + `dedup_map.md` Map 2 | REMOVE; add a "no re-encoding" entry with etymology | +| L2 | Remove `function → procedure` re-encoding (distinct concepts) | `lexicon.md` Tier 2 + Tier 4 #4.4 + `terms_catalog.md` + `dedup_map.md` Map 3 | REMOVE; distinguish function (declarative) vs procedure (imperative) | +| L3 | Remove `parameter → argument` re-encoding (distinct concepts) | `lexicon.md` Tier 2 + `terms_catalog.md` | REMOVE; clarify in Tier 2 with etymology | +| L4 | Remove `input → arg` re-encoding (distinct concepts) | `lexicon.md` Tier 2 + `terms_catalog.md` | REMOVE; clarify in Tier 2 with etymology | +| L5 | Remove `proof → construction` re-encoding (construction is a sub-type of proof) | `lexicon.md` Tier 2 + `dedup_map.md` Map 1 | REMOVE; use `construction` as a sub-type tag when proof is constructive | +| L6 | Replace `transcendental → template expression` (transcendental is a classification) | `lexicon.md` Tier 4 + `terms_catalog.md` | REPLACE; re-encoded form: classification of expressions with specific traits | +| L7 | Change encoding default from `float64` to `Scalar`/`floatX` placeholder | `lexicon.md` §1.5 + §7 + `prompt_template.md` Rule 5 | CHANGE; encoding deferred until target resolution | +| L8 | Reconcile Type/Kind; reserve `kind` for enumeration types | `lexicon.md` Tier 1 + Tier 3 + `terms_catalog.md` | RECONCILE; document `kind` reservation | +| R1 | Add `correlation` to encoding-explicit examples | `lexicon.md` §7.3 + `terms_catalog.md` | ADD; encoding `float64` | +| R4 | Add `Markov chain` to Tier 1 or Tier 3 as a type-class | `lexicon.md` + `terms_catalog.md` | ADD; form: `Markov where X -> Y -> Z` | +| R6 | Add `PolyTimeAdversary` to Tier 3 as a type-class | `lexicon.md` + `terms_catalog.md` | ADD; form: `PolyTimeAdversary : Type where runtime(A) : Polynomial(security_parameter) : int64` | +| TN1 | Add template notation B as default (`Dependent(B) <- depends(x: A)`) | `lexicon.md` Tier 3 + `terms_catalog.md` + `prompt_template.md` | ADD; B is default per user 2026-06-23 | +| TN2 | Add template notation C++ as opt-in (`Dependent`) | `lexicon.md` Tier 3 + `terms_catalog.md` | ADD; opt-in for C++ context | +| TN3 | Add template notation Odin as opt-in (`Dependent[B, x: A]`) | `lexicon.md` Tier 3 + `terms_catalog.md` | ADD; opt-in for Odin context | +| TN4 | Document template notation selection rule (B default; user requests C++/Odin) | `lexicon.md` Tier 3 + `prompt_template.md` | ADD; "use B unless the user requests X" | +| <<1 | Add `<<` / `>>` to Tier 1 (comparison operators) | `lexicon.md` Tier 1 + `terms_catalog.md` | ADD; "much less than" / "much more than" with `tolerance` | +| <<2 | Add `<<` / `>>` to Tier 4 (fuzzy relations with `tolerance`) | `lexicon.md` Tier 4 + `terms_catalog.md` | ADD; per R9/R11 | +| <<3 | Document `<<` / `>>` per-language rendering (C11 conflict, Python conflict, Forth) | `lexicon.md` §10 (NEW) + `prompt_template.md` | ADD; document C11 `much_less` / `much_greater` / `weakly_coupled` | + +**17 changes total, distributed across 5 source files + 1 new changelog + 1 new §10 in `lexicon.md`.** + +--- + +## Goals + +1. **Apply 8 corrections** that the user surfaced — these are structural mis-encodings that must be removed from the principled form. +2. **Apply 3 DEFERRED refinements** (R1, R4, R6) from the pilot report — these are gap-closers that landed during the apply phase. +3. **Adopt template notation B as default** — `Dependent(B) <- depends(x: A)`, with C++ and Odin as opt-in per context. +4. **Adopt encoding placeholder** — `Scalar` / `floatX` as the principled default; `float64` only when the user defines a target resolution. +5. **Document `<<` / `>>`** in 3 placements: Tier 1 (comparison), Tier 4 (fuzzy with tolerance), §10 (per-language rendering). +6. **Document the v1→v2 transition** in `V2_CHANGELOG.md` for auditability. +7. **Preserve lossless** — v1 state preserved in git history; v2 is the current state; the 33 deliverables + 2 reports are intermediate artifacts not re-processed. + +--- + +## Functional Requirements + +### FR1: lexicon.md v1 → v2 + +- **FR1.1:** Remove the 5 wrong re-encodings (L1-L5) from Tier 1-4. +- **FR1.2:** Add "no re-encoding; clarify with etymology" entries for: set, function, procedure, parameter, argument, input, arg, proof (with construction as sub-type tag). +- **FR1.3:** Replace the `transcendental → template expression` re-encoding (L6) with the classification form. +- **FR1.4:** Update Rule 5 (Encoding-explicit) to use `Scalar` / `floatX` as the placeholder default; `float64` only when the user defines a target resolution. +- **FR1.5:** Add 3 new entries (R1 correlation, R4 Markov chain, R6 PolyTimeAdversary) to Tier 1/3/4 as appropriate. +- **FR1.6:** Add 4 template notations (B default, C++ opt-in, Odin opt-in, Jai as Odin) to Tier 3 #3.14. +- **FR1.7:** Add `<<` / `>>` to Tier 1 (comparison) and Tier 4 (fuzzy with `tolerance`). +- **FR1.8:** Add a new §10 (Per-language rendering) documenting C11 / Python / Forth renderings of `<<` / `>>`. +- **FR1.9:** Reconcile `Type` / `Type-of-types` / `kind` — document `kind` (lowercase) reservation for enumeration types. +- **FR1.10:** Update the v1→v2 changelog references throughout (e.g., §0.4 Encoding-explicit references Rule 5 default). + +### FR2: prompt_template.md v1 → v2 + +- **FR2.1:** Update Rule 5 to use `Scalar` / `floatX` placeholder; remove `float64` default. +- **FR2.2:** Update the "real number" example from `kind : Real` resolves to `quantity : float64` to `kind : Real` resolves to `Scalar` (or `floatX`). +- **FR2.3:** Remove the 3 wrong re-encodings from "The 6 Noise-Dedup Lexicon" section: function → procedure, parameter → argument, input → arg. +- **FR2.4:** Add the `<<` / `>>` per-language rendering note (C11 `much_less` / `much_greater` / `weakly_coupled`; Python similar; Forth `a b <<` shift conflict documented). +- **FR2.5:** Update the verification checklist to reflect the encoding placeholder. + +### FR3: report.md (warmup) v1 → v2 + +- **FR3.1:** Update §1.13 (Encoding-explicit) to use `Scalar` / `floatX` placeholder. +- **FR3.2:** Update §3 (the Lexicon) to reflect the v2 entries. This is the design doc; if the lexicon moves to v2, the report's lexicon section should reflect that, not be a sibling. +- **FR3.3:** Update §9 (Operator reference) to add a footnote that `<<` / `>>` are placement-decided (Tier 1, Tier 4, per-language). +- **FR3.4:** Add §10 (Per-language rendering) — brief; the bulk lives in `lexicon.md` §10. + +### FR4: terms_catalog.md v1 → v2 + +- **FR4.1:** Update Tier 1-4 rows for the 8 corrections + 3 refinements + 4 template notations + 2 `<<` / `>>` placements. +- **FR4.2:** Add 3 new rows: correlation (Tier 4), Markov chain (Tier 1 or Tier 3), PolyTimeAdversary (Tier 3). +- **FR4.3:** Update the "Cross-tier stats" section: count of principled entries shifts. +- **FR4.4:** Update the verification checklist. + +### FR5: dedup_map.md v1 → v2 + +- **FR5.1:** Map 1 (Curry-Howard) — remove the `proof → construction` example; add note that construction is a sub-type tag. +- **FR5.2:** Map 2 (Sets=Kinds=Types) — remove the "Sets" leg; document that `set` is a data structure, not an enumerable type. +- **FR5.3:** Map 3 (Functions=Procedures=Words) — remove the "Functions" leg; document that function (declarative) and procedure (imperative) are distinct. +- **FR5.4:** Map 4 (GA collapse) — keep as `[user-preferred]`; no change (this map was already correctly marked). +- **FR5.5:** Update the cross-map relationships section to reflect the 3 map changes. +- **FR5.6:** Update the verification checklist. + +### FR6: V2_CHANGELOG.md (NEW) + +- **FR6.1:** Document each of the 17 v1→v2 changes (8 corrections, 3 refinements, 4 template notations, 2 `<<` / `>>` placements) with 1-line per change. +- **FR6.2:** Document the out-of-scope items (33 deliverables, INDEFINITE gaps, DEFERRED gaps). +- **FR6.3:** Link to the relevant sections of the 5 modified files. + +### FR7: track metadata + +- **FR7.1:** spec.md (this file) — finalized +- **FR7.2:** plan.md — TDD-ready (markdown; verification by manual review of v1→v2 changes) +- **FR7.3:** metadata.json — scope, risk register, verification criteria +- **FR7.4:** state.toml — phases, tasks, user-directives-logged +- **FR7.5:** README.md — index of the v2 track and the modified files + +--- + +## Non-Functional Requirements + +- **NFR1: No `src/*.py` changes.** Research-only track. No new `src/.py` files; no edits to existing `src/*.py`. +- **NFR2: No `pyproject.toml` dependencies.** Markdown only. +- **NFR3: No day estimates.** Scope measured in files (5 modified) + 1 new changelog + 1 new track scaffold (5 files). +- **NFR4: Per-task atomic commits.** 1 commit per file change + 1 commit for the track scaffold + 1 commit for the changelog = ~7 commits total. +- **NFR5: Git notes per commit.** Each commit has a 1-3 sentence summary attached via `git notes add`. +- **NFR6: Lossless.** v1 state preserved in git history; v2 is the current state; 33 deliverables + 2 reports not re-processed. +- **NFR7: Secular sanitization preserved.** Esoteric content (Witness/Vessel/Aether ontology) remains in `cluster_0_twitter.md`, not in this public lexicon. +- **NFR8: 1-space indent for Python code.** No code written in this track; rule preserved for any future Python edits. +- **NFR9: principled vs user-specific distinction preserved.** The formalization in v1 §0.3 stays intact; the corrections target the principled forms. + +--- + +## Architecture Reference + +- **`video_analysis_deob_warmup_20260621/report.md`** (the design doc) — the upstream of the lexicon. The report's §1.13 (Encoding-explicit) and §3 (Lexicon) are modified by this corrective pass. +- **`video_analysis_deob_warmup_20260621/prompt_template.md`** (the LLM-direct spec) — the operational form. Rule 5 and the noise-dedup lexicon are modified. +- **`video_analysis_deob_lexicon_20260621/lexicon.md`** (the codified operational spec) — the heart of the de-obfuscation. The 8 corrections + 3 refinements + 4 template notations + 2 `<<` / `>>` placements land here. +- **`video_analysis_deob_lexicon_20260621/terms_catalog.md`** (machine-readable form) — 72 terms across 4 tiers. The 5 file changes above are reflected in this catalog. +- **`video_analysis_deob_lexicon_20260621/dedup_map.md`** (the 6 noise-dedup maps) — Maps 1, 2, 3 are reshaped; Map 4 (GA collapse) is unchanged. +- **`video_analysis_deob_apply_20260621/apply_report.md`** (Phase 3 report) — the 12 refinements + 8 gaps document is the source of the 3 DEFERRED refinements (R1, R4, R6) addressed in v2. +- **`video_analysis_deob_apply_20260621/state.toml`** (Phase 3 state) — `current_phase = 6`; the phase is closed after user approval; this corrective pass happens between Pass 2 closeout and Pass 3 init. +- **Future: `video_analysis_deob_pass3_20260623/`** (Pass 3, the projection to C11/Python code) — consumes the v2 lexicon. + +The architecture is: **warmup → lexicon v1 → lexicon v2 (this track) → Pass 3 (C11/Python code)**. The v2 is the canonical substrate for Pass 3. + +--- + +## Out of Scope + +- **The 33 deliverables + 2 reports** (`apply/` folder) — **NOT re-processed**. They are intermediate artifacts; Pass 3 will produce C11/Python code directly using the v2 lexicon. +- **The 5 DEFERRED gaps** (G1 paradoxes of epiplexity, G2 incomputable classification, G4 type-class primitive, G7 spacetime from trace logic) — not addressed in v2; deferred to lexicon v3. +- **The 3 INDEFINITE gaps** (G6 quantum eigen function equivalence, G7 spacetime from trace logic, G8 Hoffman-Prakash 80% complete) — preserved with honest epistemic hedging; no change in v2. +- **The 9 FIX refinements** (5 PILOT FIX + 4 APPLY FIX) — already in the deliverables; no change in v2. +- **The 31 unresolved items from the warmup's §A.3 + §11.3** — most are still deferred; the v2 only addresses the 3 DEFERRED refinements (R1, R4, R6) from the apply phase. +- **Pass 3** (the projection to C11/Python code) — separate track; this corrective pass is its prerequisite. +- **The Sectored Language V1** — the user clarified the pseudo sectr lang is "not complete and prob needs adapting or further adjustments" for code; this is a Pass 3 concern, not a lexicon v2 concern. +- **The 4-language etymology pattern** — preserved for user-also-accepted terms; no change in v2. +- **The 6 noise-dedup maps as a whole** — only Maps 1, 2, 3 are reshaped; Maps 4, 5, 6 are unchanged. + +--- + +## Risk Register + +| # | Risk | Likelihood | Impact | Mitigation | +|---|---|---|---|---| +| R1 | The 8 corrections conflict with the 33 deliverables' existing re-encodings | medium | low | The 33 deliverables are intermediate; Pass 3 will re-process using v2. No re-write of deliverables in this pass. | +| R2 | The encoding placeholder change breaks the existing Rule 5 examples in `prompt_template.md` | low | low | Update the 5 examples in `prompt_template.md` to use `Scalar` or `floatX`. The new forms are syntactically valid (the encoding is just deferred). | +| R3 | The template notation B (`Dependent(B) <- depends(x: A)`) is ambiguous in the per-language rendering | low | low | Document the notation per-language in `lexicon.md` §10. C++ context uses `<>`; Odin uses `[]`; B is the default. | +| R4 | The user disagrees with one of the 8 corrections after seeing the v2 | low | medium | The corrections are direct quotes from the user's 2026-06-23 review. Risk is low. If disagreement, revert the specific correction. | +| R5 | The v2 lexicon breaks the principle-vs-user-specific formalization | low | high | The formalization (§0.3 of v1) is preserved. The corrections target the principled forms; user-specific forms are unchanged. | +| R6 | The `<<` / `>>` per-language rendering conflicts with C11 bit-shift | medium | low | Already documented. C11: `much_less(a, b, tolerance)` / `much_greater(a, b, tolerance)` / `weakly_coupled(a, b, tolerance)`. Python: same. Forth: `a b <<` is bit-shift; document the conflict. | +| R7 | The changelog is incomplete or inaccurate | low | low | The V2_CHANGELOG.md is generated by the same Tier 2 that applies the changes; verification by manual review. | +| R8 | The git history is broken by the in-place edits | low | low | v1 is preserved in git history (the commits that added v1 still exist); v2 is the current state. Diff is auditable. | +| R9 | The per-task atomic commit discipline is violated | low | low | 1 commit per file change + 1 commit for the track scaffold + 1 commit for the changelog = ~7 commits. Each is independent and revertable. | + +--- + +## Verification Criteria + +- [x] **VC1:** All 8 corrections (L1-L8) applied across the 5 source files. +- [x] **VC2:** All 3 DEFERRED refinements (R1, R4, R6) added to the lexicon. +- [x] **VC3:** All 4 template notations (B, C++, Odin, Jai) documented with B as default. +- [x] **VC4:** Both `<<` / `>>` placements (Tier 1, Tier 4) added; per-language rendering documented in `lexicon.md` §10. +- [x] **VC5:** Rule 5 (Encoding-explicit) updated to use `Scalar` / `floatX` placeholder; `float64` only when target resolution defined. +- [x] **VC6:** V2_CHANGELOG.md documents all 17 changes with 1-line per change. +- [x] **VC7:** principled vs user-specific formalization preserved (v1 §0.3 unchanged in spirit). +- [x] **VC8:** 33 deliverables + 2 reports NOT re-processed (intermediate artifacts; Pass 3 will use v2). +- [x] **VC9:** No `src/*.py` changes; no `pyproject.toml` deps; no day estimates. +- [x] **VC10:** Per-task atomic commits with git notes (≥ 7 commits). + +--- + +*End of `spec.md`. Total: 14 sections. Scope: 5 source files modified + 1 new changelog + 1 new track scaffold. Targeted, not exhaustive.* diff --git a/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/state.toml b/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/state.toml new file mode 100644 index 00000000..16d4c3d8 --- /dev/null +++ b/conductor/tracks/video_analysis_deob_lexicon_v2_20260623/state.toml @@ -0,0 +1,73 @@ +# Track state for video_analysis_deob_lexicon_v2_20260623 +# Updated by Tier 2 Tech Lead during execution + +[meta] +track_id = "video_analysis_deob_lexicon_v2_20260623" +name = "Lexicon v2 Patch (Pass 2 Phase 1.5 - Post-Apply Corrections)" +status = "active" # active | completed +current_phase = 1 # 0 = pre-Phase 1; 1..N = in Phase N; "complete" if all phases done +last_updated = "2026-06-23" + +[blocked_by] +video_analysis_deob_apply_20260621 = "shipped 2026-06-23 (8f2e8a69)" + +[blocks] +# Pass 3 (projection to C11/Python code, future, user-led) is the next user-led track +video_analysis_deob_pass3_20260623 = "planned in video_analysis_deob_lexicon_v2_20260623" + +[phases] +phase_1 = { status = "pending", checkpointsha = "", name = "Apply v1->v2 corrections to 5 source files" } +phase_2 = { status = "pending", checkpointsha = "", name = "Verification (manual review + user approval)" } + +[tasks] +# Phase 1 (apply v1->v2 corrections) +t1_1 = { status = "pending", commit_sha = "", description = "Update lexicon.md: 8 corrections + 3 refinements + 4 template notations + 2 << / >> placements + new §10" } +t1_2 = { status = "pending", commit_sha = "", description = "Update terms_catalog.md: machine-readable rows for 17 changes" } +t1_3 = { status = "pending", commit_sha = "", description = "Update dedup_map.md: reshape Maps 1, 2, 3" } +t1_4 = { status = "pending", commit_sha = "", description = "Update prompt_template.md: Rule 5 encoding placeholder + remove wrong re-encodings + per-language << / >> note" } +t1_5 = { status = "pending", commit_sha = "", description = "Update report.md (warmup): §1.13 encoding placeholder + §3 lexicon v2 + §10 per-language rendering" } +t1_6 = { status = "pending", commit_sha = "", description = "Write V2_CHANGELOG.md: document 17 v1->v2 changes + out-of-scope" } +t1_7 = { status = "pending", commit_sha = "", description = "Update tracks.md: add v2 track row" } + +# Phase 2 (verification) +t2_1 = { status = "pending", commit_sha = "", description = "Manual review of all 17 changes (verify each is correctly applied; lossless preservation)" } +t2_2 = { status = "pending", commit_sha = "", description = "User review + approval; await Pass 3 dispatch" } + +[verification] +all_8_corrections = false +all_3_refinements = false +all_4_template_notations = false +both_lt_lt_placements = false +rule_5_placeholder = false +v2_changelog_documented = false +principled_user_specific_preserved = false +deliverables_not_reprocessed = false +no_src_changes = false +atomic_commits = false +user_approved = false +state_toml_completed = false + +[user_directives_logged] +set_is_data_structure = "Per user 2026-06-23: 'Do not reencode set as a kind. Set is a specific data structure, not directly related to an enumerable type.'" +function_vs_procedure = "Per user 2026-06-23: 'A procedure is not necessarily a function, a function has a specific declarative sense it, a procedure is imperative.'" +parameter_vs_argument = "Per user 2026-06-23: 'Parameters and arguments are also distinctly separate terms they are not analogous.'" +input_vs_arg = "Per user 2026-06-23: 'input and arg are also not the same.'" +type_genus_kind_analogous = "Per user 2026-06-23: 'Type/Genus/Kind are analogous no need to reencode.'" +proof_vs_construction = "Per user 2026-06-23: 'Proof and construction maybe, a construction is a type of proof.'" +transcendental_is_classification = "Per user 2026-06-23: 'transcendental is a classification of expressions that resolve to a specific sequence ... and fullfills very specific traits.'" +encoding_placeholder = "Per user 2026-06-23: 'Scalar as undefined resolution is fine. floatX when X defines a placeholder until user defines resolution. Int64 same thing, just call it integer until resolution definition is necessary.'" +encoding_taxonomy_useful = "Per user 2026-06-23: 'I do like the encoding taxonomy table you have when picking a resolution matters though.'" +ontology_procedure_shape = "Per user 2026-06-23: 'You can observe the shape of the procedure, not all possible result combinations or resolutions for a given metric utilized with that procedure. If the hedging is unrelated to this I need clarification.' [Tier 1 confirmed: hedging IS related]" +template_notation_b_default = "Per user 2026-06-23: 'B as default. Use c+++ template language style in specific scenario when the user asks for it or if its better. (same with jai/odin language have the lexicon support it if the user desires it).'" +much_less_much_greater = "Per user 2026-06-23: 'weakly_coupled(...) is good for c11, much_less and much_greater can be used as well.'" +near_n_operator = "Per user 2026-06-23: 'Maybe for near N can be near(N), or near N is near as the operator on N assigning attribution... can be used as infix A near B... or postfix A B near ... tolerance seems to clarify that you have a nearly equal float episilon.'" +stream_A_nat_arrow_A = "Per user 2026-06-23: 'Stream A = nat -> A is fine as potentially infinite process. To me infinite process is just something that is non-halting unless giving a specific termination quantifier or resolution.'" +applied_domain_c11_or_python = "Per user 2026-06-23: 'The applied domain is making a simple program in C11 or python that conveys what the subject video provides. If in c11 it should follow a specific covention of c11 that raddbg uses or that I use with my duffle libs in pikuma ps1 or the forth bootslop projects. For python it should use manual slop's convention.'" +user_specific_forms_annotation = "Per user 2026-06-23: 'Forms are somehwat annotation if its not code. When it comes to the code psuedo sectr lang is not complete and prob needs adapting or further adjustments.'" +hardware_target_up_to_10k = "Per user 2026-06-23: 'target up to 10k. when doing phase 3 you may have to pick a target, just do so within reason for a local hardware scenario (user with around a 1-10k USD budget I guess).'" +criteria_ok_may_ideate = "Per user 2026-06-23: 'criteria are ok, we'll have to ideate maybe for applied domain basd on how I elaborated if you don't think it matches enough.'" +pass3_artifact_code = "Per user 2026-06-23: 'pass 3 is markdown for docs, but its going to be code files that may or may not run.'" +report_v2_in_place = "Per user 2026-06-23: 'Also update the warmup report maybe.. make a report v2?' [Tier 1 interpretation: in-place update + REPORT_V2_CHANGELOG.md to document transition]" + +[no_day_estimates] +"Enforced per conductor/workflow.md Tier 1 Track Initialization Rules (added 2026-06-16). Scope measured in files (5 modified + 1 new changelog + 1 new track scaffold)."