8e3543d875
Grok's own recommendation (consulted 2026-06-11): 'xAI (Grok) | xAI official OpenAI-compatible (https://api.x.ai/v1) | Fully compatible and clean. Supports Grok-2 + Grok-2-Vision. No meaningful unique native surface lost by using the compatible endpoint.' This REVERSES the earlier 'xAI native' correction. The OpenAI- compatible approach for Grok is the canonical full-featured path; the implementation in Phase 3 (OpenAI SDK with base_url=https://api.x.ai/v1 + send_openai_compatible helper) is correct as-is. Updates to the spec: 1. §3.1.1: replaced the 'use xAI native' decision with the confirmed per-vendor table. Qwen=Native, Grok=OpenAI-Compatible (per Grok's own confirmation), MiniMax=OpenAI-Compatible, DeepSeek=OpenAI- Compatible, Ollama=OpenAI-Compatible-in-v1 (native in v2), Meta Llama API=Native (new 4th backend, follow-up), Gemini=Native (follow-up), Anthropic=Native (follow-up). Also added Grok's recommended v2 matrix field expansion: audio, video, grounding, computer_use, local, reasoning/extended_thinking, web_search, x_search, code_execution, file_search, mcp_support, structured_output. 2. §4.3: reverted from 'Grok via xAI (Native REST API)' back to 'Grok via xAI (OpenAI-Compatible) - confirmed 2026-06-11'. The implementation does NOT need a native refactor; the OpenAI SDK at https://api.x.ai/v1 is the canonical approach. Removed the earlier 'caching: true' entry from the registry (since the OpenAI-compat shim doesn't expose prompt_cache_key) and the 'no persistent client' state struct (back to the OpenAI SDK pattern). 3. §13.1.B: renamed from 'Native Vendor APIs' to 'Llama Native APIs (Ollama native + Meta Llama API)' and removed the Grok native refactor item (Grok says OpenAI-compat is fine). Kept the Ollama native + Meta Llama API items + matrix expansion. Clarified that Grok tests do NOT need rewriting; only Llama tests get 2 more (native Ollama, Meta Llama API). Net effect: the Phase 3 work that just shipped (Grok+Llama Green using OpenAI-compat shim) is CORRECT as-is. The implementation matches Grok's actual recommendation. No code rollback needed.