Private
Public Access
0
0
Commit Graph

16 Commits

Author SHA1 Message Date
ed 1bea0d23bf fix(test): correct filename typo manualslop.toml -> manual_slop.toml in project switch
Tier 2's project-switch fix (commit 455c17ff) was correct but used
'manualslop.toml' (no underscore) instead of 'manual_slop.toml'. The
if Path(workspace_toml).exists() check was False, so the switch was
silently skipped — the subprocess stayed on whatever stale project a
prior test left, and the RAG engine used the wrong base_dir.

Fixing the filename makes the project switch actually fire. The test
now passes 4/4 runs in isolation (6-7s each). The RAG context block
appears in the discussion history as expected.
2026-06-28 09:24:06 -04:00
ed 3c7455fdbe test(rag): wait for files setter before triggering RAG sync
The set_value('files', ...) call is async (push_event -> pending_gui_tasks
-> render loop). The RAG setters (rag_enabled, rag_source, rag_emb_provider)
are also async and each triggers a RAG sync via submit_io. The syncs and
the files setter are NOT ordered: the sync may fire before the files
setter is processed, in which case the sync sees self.files == [] and
skips the rebuild (RAG sync only triggers the rebuild if both
is_empty() AND self.files are truthy).

Fix: poll get_value('files') until the expected value is reflected,
guaranteeing the files setter is processed before the RAG setters
trigger their syncs. Belt-and-suspenders alongside the project-switch
fix from the previous commit.

The test was passing in 4d2a6666 because of timing; the project
switch added latency, so the race is now exposed.
2026-06-28 00:01:22 -04:00
ed 455c17ffb2 test(rag): switch to workspace project explicitly before configuring RAG
Per Tier 1 addendum 3 (the real defect): tests hotpatch individual state
fields via set_value instead of calling the proper project-switch
flow. The session-scoped subprocess may be on a stale project from a
prior test (e.g. test_context_sim_live switches to
temp_livecontextsim.toml and never switches back). The RAG engine uses
active_project_root (derived from active_project_path) as its base_dir,
NOT ui_files_base_dir. So hotpatching files/rag_enabled via set_value
while active_project_path is stale leaves the RAG engine looking at a
dead dir.

Fix: switch to the workspace project explicitly at the start of the
test (like a user would) using client.push_event('custom_callback',
...) + client.wait_for_project_switch(...). The path must be absolute
because the subprocess's CWD is the workspace, so a relative path
like 'tests/artifacts/.../manualslop.toml' would resolve to the wrong
dir from the subprocess's CWD.

Verified: the switch fires successfully (no WARNING printed). But the
RAG search still returns 0 chunks — the index_file rebuild is not
adding the files. The exact cause is still under investigation.

This is the proper fix per Tier 1 (NOT "delete stale files" which
treats the symptom). The sim tests' teardown() also needs a switch-back
to the workspace project (separate track).
2026-06-27 23:55:41 -04:00
ed 4d2a6666a4 fix(rag): convert RAGChunk to dict in _rag_search_result to match type contract
The RAG engine's search() returns List[RAGChunk] (dataclass instances),
but _rag_search_result's return type is Result[list[Metadata]] (a list
of dicts). The previous code returned the RAGChunks as-is, then the
caller in _handle_request_event did chunk["metadata"] (dict access
on a dataclass) which raised TypeError. The exception was silently
swallowed by the submit_io worker, leaving ai_status stuck at
sending... for the full 50-second test poll before failing.

Two surgical changes:
1. _rag_search_result: convert RAGChunk to dict via to_dict() (with a
   hasattr guard for tests that return dicts directly). Matches the
   function's documented return type.
2. _handle_request_event: use isinstance guards + dict.get() on the
   chunk fields. Defensive against the type mismatch and matches the
   dict contract.

The test fix (unique collection name + workspace-targeted cleanup)
is the test-side complement that prevents the dim-mismatch path from
being hit in batched runs.

Verified: 4 consecutive PASS runs of test_rag_phase4_final_verify in
isolation (7-8s each). 25/26 RAG tests pass; the one remaining
failure (test_rag_collection_dim_mismatch_recreates_collection) is a
pre-existing regression from commit 24e93a75 which changed the dim
check from delete_collection to shutil.rmtree without updating the
test mock setup. Out of scope for this fix.
2026-06-27 20:58:36 -04:00
ed 5a9b8d6891 fix(test+rag): clean chroma cache pre-test + add INVESTIGATE stderr for RAG init 2026-06-10 17:20:57 -04:00
ed 8f7de45aca fix(rag): robust test polling for entry race + stress test timing tolerance 2026-06-10 14:43:27 -04:00
ed 15ffc3a34f fix(rag): make test assertion accept either file's content (robust to chroma ordering) 2026-06-10 13:53:52 -04:00
ed 1cd3444e4c test(rag): mark RAG tests with clean_baseline for batch isolation 2026-06-09 16:56:55 -04:00
ed 006bb11488 refactor(test): 5 test files use live_gui_workspace fixture instead of hardcoded path 2026-06-09 16:14:40 -04:00
ed c96bdb06ba test(rag_phase4): handle None status before .lower() in error check 2026-06-05 12:38:47 -04:00
ed e2305ff49a Antigravity is dog shit. 2026-05-20 07:51:58 -04:00
ed 7f2f9c1989 fix: Robustness improvements for RAG tests and GUI stability
- Added import sys to src/api_hook_client.py.
- Fixed App.__getattr__ to use direct attribute access on controller to avoid recursion.
- Simplified _get_app_attr and _has_app_attr in src/api_hooks.py.
- Centralized RAG and symbol enrichment in AppController._handle_request_event.
- Updated 	ests/test_symbol_parsing.py to match the new enrichment flow.
- Removed redundant task appending from i_status and mma_status setters.
- Improved _sync_rag_engine to only set 'ready' status after indexing is confirmed.
- Updated 	est_status_encapsulation.py to reflect setter changes.
2026-05-15 17:17:05 -04:00
ed 2d76381796 fix(rag): Resolve RAG test failures and race conditions
- Fixed circular import in chromadb by using lazy imports in 
ag_engine.py.
- Moved RAG engine initialization to background threads in AppController to avoid blocking UI.
- Added _rag_engine_lock to prevent race conditions during engine re-initialization.
- Updated Gemini embedding model to gemini-embedding-001 (available) from 	ext-embedding-004 (not found).
- Fixed _rebuild_rag_index to use fresh 
ag_engine instance from self in every iteration.
- Optimized 	est_rag_phase4_final_verify.py and 	est_rag_phase4_stress.py to wait for RAG sync before continuing.
- Added dummy embedding fallback in LocalEmbeddingProvider if sentence-transformers fails to load.
2026-05-14 22:23:48 -04:00
ed 7974f661b3 fix(phase6): resolve minimax regression and context snapshotting crash 2026-05-10 14:58:29 -04:00
ed b958fa2819 refactor(phase5): Comprehensive stabilisation pass. De-duplicated App/Controller state, hardened session reset, and updated integration tests with deterministic polling. 2026-05-09 16:55:45 -04:00
ed 7bed4a8f97 conductor(checkpoint): Final checkpoint for RAG Support track - Phase 4 complete 2026-05-04 22:36:31 -04:00