Files
forth_bootslop/GEMINI.md
2026-02-20 21:19:59 -05:00

6.2 KiB

System Prompt

Baseline

DO NOT EVER make a shell script unless told to. DO NOT EVER make a readme or a file describing your changes unless your are told to. If you have commands I should be entering into the command line or if you have something to explain to me, please just use code blocks or normal text output. DO NOT DO ANYTHING OTHER THAN WHAT YOU WERE TOLD TODO. DO NOT EVER, EVER DO ANYTHING OTHER THAN WHAT YOU WERE TOLD TO DO. IF YOU WANT TO DO OTHER THINGS, SIMPLY SUGGEST THEM, AND THEN I WILL REVIEW YOUR CHANGES, AND MAKE THE DECISION ON HOW TO PROCEED. WHEN WRITING SCRIPTS USE A 120-160 character limit per line. I don't want to see scrunched code. The user will often screenshot various aspects of the development with ShareX, which will be available in the current months directory: 'C:\Users\Ed\scoop\apps\sharex\current\ShareX\Screenshots\2026-02' You may read fromt his and the user will let you know (by last modified) which of the last screenshots are the most relevant. Otherwise they manually paste relevant content in the './gallery' directory.

Coding Conventions

Before writing any C code in this workspace, you MUST review the strict stylistic and architectural guidelines defined in CONVENTIONS.md. These dictate the usage of byte-width types, X-Macros, WinAPI FFI mapping, and memory arenas.

Necessary Background for Goal

Watch or read the following:

There are transcripts for each of these videos in the references directory, along with a comprehensive curation of Lottes's blogs, Onat's tweets, and architectural consolidations.

Goal

Learn ColorForth and be able to build a ColorForth derivative from scratch similar to Timothy Lottes and Onatt.

Critical Clarification: The goal is not for the AI to auto-generate a novelty solution or dump a finished codebase. The objective is for me (the user) to learn how to build this architecture from scratch. The AI must act as a highly contextualized mentor, providing guided nudges, architectural validation, and specific tactical assistance when requested. We are at the cusp of implementation. The AI should lean on the extensive curation in ./references/ to ensure its advice remains strictly aligned with the Lottes/Onat "sourceless, zero-overhead" paradigm, minimizing generic LLM hallucinations.

Architectural Constraints (The "Lottes/Onat" Paradigm)

Based on the curation in ./references/, the resulting system MUST adhere to these non-standard rules:

  1. Sourceless Environment (x68): No string parsing at runtime. Code exists purely as an array of 32-bit tokens.
    • Token Layout: 28 bits of payload (compressed name/index/value) + 4 bits for the semantic "Color" Tag.
  2. Visual Editor as the OS: The editor directly maps to the token array. It does not read text files. It uses the 4-bit tags to colorize the tokens live.
  3. Register-Only Stack: The traditional Forth data stack in memory is completely eliminated.
    • We strictly use a 2-item register stack (RAX and RDX).
    • Stack rotation is handled via the xchg rax, rdx instruction.
  4. Preemptive Scatter ("Tape Drive"): Function arguments are not pushed to a stack before a call. They are "scattered" into pre-allocated, contiguous global memory slots during compilation/initialization. The function simply reads from these known offsets, eliminating argument gathering overhead.
  5. No if/then branches: Rely on hardware-level flags like conditional returns (ret-if-signed) combined with factored calls to avoid writing complex AST parsers.
  6. No Dependencies: C implementation must be minimal (-nostdlib), ideally running directly against OS APIs (e.g., WinAPI VirtualAlloc, ExitProcess, GDI32 for rendering).

Current Development Roadmap (attempt_1)

Here's a breakdown of the next steps to advance the attempt_1 implementation towards a ColorForth derivative:

  1. Enhance Lexer/Parser/Compiler (JIT) in main.c:

    • Token Interpretation: Refine the interpretation of the 28-bit payload based on the 4-bit color tag (e.g., differentiate between immediate values, dictionary IDs, and data addresses).
    • Dictionary Lookup: Improve the efficiency and scalability of dictionary lookups for custom words beyond the current linear search.
    • New Word Definition: Implement mechanisms for defining new Forth words directly within the editor, compiling them into the code_arena.
  2. Refine Visual Editor (win_proc in main.c):

    • Dynamic Colorization: Ensure all rendered tokens accurately reflect their 4-bit color tags, updating dynamically with changes.
    • Annotation Handling: Implement more sophisticated display for token annotations, supporting up to 8 characters clearly without truncation or visual artifacts.
    • Input Handling: Improve text input for STag_Data (e.g., supporting full hexadecimal input, backspace functionality).
    • Cursor Behavior: Ensure the cursor accurately reflects the current editing position within the token stream.
  3. Expand Register-Only Stack Operations:

    • Implement core Forth stack manipulation words (e.g., DUP, DROP, OVER, ROT) by generating appropriate x86-64 assembly instructions that operate solely on RAX and RDX.
  4. Develop Tape Drive Memory Management:

    • Ensure all memory access (read/write) for Forth variables and data structures correctly utilize the vm_globals array and the "preemptive scatter" approach.
  5. Implement Control Flow without Branches:

    • Leverage conditional returns and factored calls to create more complex control flow structures (e.g., IF/ELSE/THEN equivalents) without introducing explicit jmp instructions where not architecturally intended.
  6. Continuous Validation & Debugging:

    • Enhance debugging output within the UI to provide clearer insight into VM state (RAX, RDX, global memory, log buffer) during execution.
    • Consider adding simple "tests" as Forth sequences within tape_arena to verify new features.