Switch to RLM instead of ReAct

2026-01-21 19:57:30 -08:00 · 2026-01-21 18:53:07 -08:00
5 changed files with 311 additions and 49 deletions
--- a/README.md
+++ b/README.md
@@ -1,6 +1,6 @@
 # nanocode
-Minimal Claude Code alternative using DSPy ReAct! Single Python file, zero dependencies, ~250 lines.
+Minimal Claude Code alternative using DSPy RLM! Single Python file, ~390 lines.
 Built using Claude Code, then used to build itself.
@@ -8,57 +8,272 @@ Built using Claude Code, then used to build itself.
 ## Features
- Full agentic loop with tool use
+- Full agentic loop with tool use via [DSPy RLM](https://dspy.ai/)
 - Tools: `read`, `write`, `edit`, `glob`, `grep`, `bash`
- Conversation history
+- Conversation history with context
 - Colored terminal output
 - **Modaic Integration**: Push, version, and share as a [Modaic](https://modaic.dev) autoprogram
-### OpenRouter
+---
-Use [OpenRouter](https://openrouter.ai) to access any model:
+## Prerequisites
 Before using nanocode (or any DSPy RLM-based program), you need to install the Deno code interpreter:
 ```bash
 brew install deno
 ```
 This is required for the RLM's code execution capabilities.
 ---
 ## Quick Start
 ### Option 1: Use as a Modaic AutoProgram
 Load and run nanocode directly from the Modaic Hub without cloning:
 ```python
 from modaic import AutoProgram
 # Load the precompiled nanocode agent from Modaic Hub
 agent = AutoProgram.from_precompiled(
    "farouk1/nanocode",
    config={
        "lm": "openrouter/anthropic/claude-3.5-sonnet",
        "max_iters": 20
    }
 )
 # Run a coding task
 result = agent(task="What Python files are in this directory?")
 print(result.answer)
 print(result.affected_files)
 ```
 ### Option 2: Run Locally (Interactive CLI)
 ```bash
 export OPENROUTER_API_KEY="your-key"
 python nanocode.py
 ```
-To use a different model:
+To use a specific model:
 ```bash
 export OPENROUTER_API_KEY="your-key"
-export MODEL="openai/gpt-5.2"
+export MODEL="openai/gpt-4"
 python nanocode.py
 ```
-## Commands
+---
- `/c` - Clear conversation
+## Configuration
- `/q` or `exit` - Quit
+
 When using as a Modaic AutoProgram, you can configure these options:
 | Parameter | Type | Default | Description |
 |-----------|------|---------|-------------|
 | `lm` | str | `openrouter/anthropic/claude-3.5-sonnet` | Primary language model |
 | `sub_lm` | str | `openrouter/openai/gpt-4.1` | Sub-LM for reasoning steps |
 | `max_iters` | int | `20` | Maximum agent iterations |
 | `api_base` | str | `https://openrouter.ai/api/v1` | API base URL |
 | `max_tokens` | int | `16000` | Maximum tokens per request |
 | `max_output_chars` | int | `100000` | Maximum output character limit |
 | `verbose` | bool | `False` | Enable verbose logging |
 Example with custom configuration:
 ```python
 from modaic import AutoProgram
 agent = AutoProgram.from_precompiled(
    "farouk1/nanocode",
    config={
        "lm": "openrouter/openai/gpt-4",
        "sub_lm": "openrouter/openai/gpt-3.5-turbo",
        "max_iters": 30,
        "max_tokens": 8000,
        "verbose": True
    }
 )
 ```
 ---
 ## CLI Commands
 | Command | Description |
 |---------|-------------|
 | `/c` | Clear conversation history |
 | `/q` or `exit` | Quit the application |
 ---
 ## Tools
-| Tool | Description |
+The agent has access to the following tools:
 |------|-------------|
 | `read` | Read file with line numbers, offset/limit |
 | `write` | Write content to file |
 | `edit` | Replace string in file (must be unique) |
 | `glob` | Find files by pattern, sorted by mtime |
 | `grep` | Search files for regex |
 | `bash` | Run shell command |
-## Example
+| Tool | Function | Description |
 |------|----------|-------------|
 | `readfile` | `read_file(path, offset, limit)` | Read file contents with line numbers |
 | `writefile` | `write_file(path, content)` | Write content to a file |
 | `editfile` | `edit_file(path, old, new, replace_all)` | Replace text in a file (old must be unique unless `replace_all=True`) |
 | `globfiles` | `glob_files(pattern, path)` | Find files matching a glob pattern, sorted by modification time |
 | `grepfiles` | `grep_files(pattern, path)` | Search files for a regex pattern |
 | `runbash` | `run_bash(cmd)` | Run a shell command and return output |
 ---
 ## Example Usage
 ### Interactive CLI
 ```
 ────────────────────────────────────────
 ❯ what files are here?
 ────────────────────────────────────────
-⏺ Glob(**/*.py)
+⏺ Thinking...
-  ⎿  nanocode.py
+  ⏺ globfiles(pattern='**/*', path='.')
-⏺ There's one Python file: nanocode.py
+⏺ I found the following files:
  - nanocode.py
  - README.md
  - modaic/SKILL.md
 ```
 ### Programmatic Usage
 ```python
 from modaic import AutoProgram
 agent = AutoProgram.from_precompiled("farouk1/nanocode")
 # Read a file
 result = agent(task="Read the first 10 lines of nanocode.py")
 print(result.answer)
 # Search for patterns
 result = agent(task="Find all functions that contain 'file' in their name")
 print(result.answer)
 # Make edits
 result = agent(task="Add a comment at the top of README.md")
 print(result.affected_files)  # ['README.md']
 ```
 ---
 ## Architecture
 ### Overview
 ```
 nanocode.py
 ├── File Operations
 │   ├── read_file()    - Read with line numbers
 │   ├── write_file()   - Write content
 │   └── edit_file()    - Find & replace
 ├── Search Operations
 │   ├── glob_files()   - Pattern matching
 │   └── grep_files()   - Regex search
 ├── Shell Operations
 │   └── run_bash()     - Execute commands
 ├── DSPy Components
 │   ├── CodingAssistant (Signature)
 │   └── RLMCodingProgram (PrecompiledProgram)
 └── Modaic Integration
    └── RLMCodingConfig (PrecompiledConfig)
 ```
 ### Key Classes
 #### `RLMCodingConfig`
 Configuration class extending `PrecompiledConfig` for experiment-specific parameters.
 ```python
 class RLMCodingConfig(PrecompiledConfig):
    max_iters: int = 20
    lm: str = "openrouter/anthropic/claude-3.5-sonnet"
    sub_lm: str = "openrouter/openai/gpt-4.1"
    api_base: str = "https://openrouter.ai/api/v1"
    max_tokens: int = 16000
    max_output_chars: int = 100000
    verbose: bool = False
 ```
 #### `RLMCodingProgram`
 Main program class extending `PrecompiledProgram`. Wraps a DSPy RLM agent with coding tools.
 ```python
 class RLMCodingProgram(PrecompiledProgram):
    config: RLMCodingConfig
    def forward(self, task: str) -> dspy.Prediction:
        # Returns prediction with .answer and .affected_files
        return self.agent(task=task)
 ```
 #### `CodingAssistant`
 DSPy Signature defining the agent's input/output schema.
 ```python
 class CodingAssistant(dspy.Signature):
    task: str = dspy.InputField()
    answer: str = dspy.OutputField()
    affected_files: list[str] = dspy.OutputField()
 ```
 ---
 ## Publishing Your Own Version
 If you modify nanocode and want to publish your own version to Modaic Hub:
 ```python
 from nanocode import RLMCodingProgram, RLMCodingConfig
 # Create and optionally optimize your program
 program = RLMCodingProgram(RLMCodingConfig())
 # Push to your Modaic Hub repo
 program.push_to_hub(
    "your-username/my-nanocode",
    commit_message="My customized nanocode",
    with_code=True  # Include source code for AutoProgram loading
 )
 ```
 ---
 ## Dependencies
 - [DSPy](https://dspy.ai/) - Framework for programming language models
 - [Modaic](https://modaic.dev/) - Hub for sharing and versioning DSPy programs
 - OpenRouter API key (for accessing language models)
 Install dependencies:
 ```bash
 pip install dspy modaic
 # or with uv
 uv add dspy modaic
 ```
 ---
 ## Environment Variables
 | Variable | Required | Description |
 |----------|----------|-------------|
 | `OPENROUTER_API_KEY` | Yes | Your OpenRouter API key |
 | `MODEL` | No | Override the default model selection |
 | `MODAIC_TOKEN` | For Hub | Required for pushing/loading from Modaic Hub |
 ---
 ## License
 MIT
--- a/config.json
+++ b/config.json
@@ -1,10 +1,11 @@
 {
  "model": null,
  "max_iters": 20,
-  "lm": "openai/gpt-5.2-codex",
+  "lm": "openrouter/anthropic/claude-3.5-sonnet",
  "sub_lm": "openrouter/openai/gpt-4.1",
  "api_base": "https://openrouter.ai/api/v1",
-  "max_tokens": 16000,
+  "max_tokens": 32000,
  "max_output_chars": 100000,
-  "verbose": false
+  "verbose": false,
  "track_usage": true
 }
--- a/nanocode.py
+++ b/nanocode.py
@@ -234,16 +234,6 @@ class CodingAssistant(dspy.Signature):
    )
 tools = {
    "readfile": read_file,
    "writefile": write_file,
    "editfile": edit_file,
    "globfiles": glob_files,
    "grepfiles": grep_files,
    "runbash": run_bash,
 }
 class ToolLoggingCallback(BaseCallback):
    """Callback that logs tool calls as they happen."""
@@ -278,9 +268,10 @@ class RLMCodingConfig(PrecompiledConfig):
    lm: str = "openrouter/anthropic/claude-3.5-sonnet"  # Default fallback
    sub_lm: str = "openrouter/openai/gpt-4.1"  # Default fallback
    api_base: str = "https://openrouter.ai/api/v1"
-    max_tokens: int = 16000
+    max_tokens: int = 32000
    max_output_chars: int = 100000
    verbose: bool = False
    track_usage: bool = True
 class RLMCodingProgram(PrecompiledProgram):
@@ -289,6 +280,14 @@ class RLMCodingProgram(PrecompiledProgram):
    def __init__(self, config: RLMCodingConfig, **kwargs):
        self.config = config
        super().__init__(config, **kwargs)
        self.tools = {
            "read_file": read_file,
            "write_file": write_file,
            "edit_file": edit_file,
            "glob_files": glob_files,
            "grep_files": grep_files,
            "run_bash": run_bash,
        }
        # tool logging for introspections on multi-turn conversations
        dspy.settings.configure(callbacks=[ToolLoggingCallback()])
@@ -296,16 +295,18 @@ class RLMCodingProgram(PrecompiledProgram):
            self.config.lm,
            api_base=self.config.api_base,
            max_tokens=self.config.max_tokens,
            track_usage=self.config.track_usage,
        )
        sub_lm = dspy.LM(
            self.config.sub_lm,
            api_base=self.config.api_base,
            max_tokens=self.config.max_tokens,
            track_usage=self.config.track_usage,
        )
        agent = dspy.RLM(
            CodingAssistant,
            sub_lm=sub_lm,
-            tools=tools,
+            tools=self.tools,
            max_output_chars=self.config.max_output_chars,
            max_iterations=self.config.max_iters,
            verbose=self.config.verbose,
@@ -318,6 +319,15 @@ class RLMCodingProgram(PrecompiledProgram):
        assert task, "Task cannot be empty"
        return self.agent(task=task)
    def get_tools(self):
        return self.tools
    def set_tool(self, name: str, tool: callable):
        self.tools[name] = tool
    def remove_tool(self, name: str):
        del self.tools[name]
 def main():
    model = os.getenv("MODEL")
    if model is None:
@@ -332,7 +342,7 @@ def main():
    agent = RLMCodingProgram(config)
    print(
-        f"{BOLD}nanocode-dspy{RESET} | {DIM}{agent.config.lm} | {os.getcwd()}{RESET}\n"
+        f"{BOLD}NANOCODE DSPY{RESET} | {DIM}{agent.config.lm} | {os.getcwd()}{RESET}\n"
    )
    # Conversation history for context
@@ -352,6 +362,37 @@ def main():
                history = []
                print(f"{GREEN}⏺ Cleared conversation{RESET}")
                continue
            if user_input == "/model":
                print(f"\n{BOLD}Current model: {agent.config.lm}{RESET}")
                print(f"\n{BOLD}Select a new model:{RESET}")
                for key, (name, model_id) in AVAILABLE_MODELS.items():
                    print(f"  {BLUE}{key}{RESET}. {name} ({DIM}{model_id}{RESET})")
                print(f"  {BLUE}c{RESET}. Custom model (enter manually)")
                print(f"  {BLUE}k{RESET}. Keep current model")
                choice = input(f"\n{BOLD}{BLUE}❯{RESET} Enter choice: ").strip().lower()
                if choice == "k":
                    print(f"{GREEN}⏺ Keeping current model: {agent.config.lm}{RESET}")
                    continue
                elif choice in AVAILABLE_MODELS:
                    name, model_id = AVAILABLE_MODELS[choice]
                    new_model = model_id if model_id.startswith("openrouter/") else f"openrouter/{model_id}"
                    config.lm = new_model
                    agent = RLMCodingProgram(config)
                    print(f"{GREEN}⏺ Switched to: {name} ({new_model}){RESET}")
                elif choice == "c":
                    custom_model = input(f"{BOLD}{BLUE}❯{RESET} Enter model ID: ").strip()
                    if custom_model:
                        new_model = custom_model if custom_model.startswith("openrouter/") else f"openrouter/{custom_model}"
                        config.lm = new_model
                        agent = RLMCodingProgram(config)
                        print(f"{GREEN}⏺ Switched to custom model: {new_model}{RESET}")
                    else:
                        print(f"{RED}⏺ Invalid model ID, keeping current model{RESET}")
                else:
                    print(f"{RED}⏺ Invalid choice, keeping current model{RESET}")
                continue
            # Build context from history
            context = f"Working directory: {os.getcwd()}\n"
@@ -364,11 +405,14 @@ def main():
            print(f"\n{CYAN}⏺{RESET} Thinking...", flush=True)
-            # Run the ReAct agent
+            # Run the RLM agent
            result = agent(task=task)
            # Display the answer
            print(f"\n{CYAN}⏺{RESET} {render_markdown(result.answer)}")
            # Display usage
            print(f"\n{MAGENTA}⏺ Debug Prediction: {result}{RESET}")
            # Save to history
            history.append({"user": user_input, "assistant": result.answer})
@@ -385,6 +429,6 @@ def main():
 if __name__ == "__main__":
-    agent = RLMCodingProgram(RLMCodingConfig(lm="openai/gpt-5.2-codex"))
+    agent = RLMCodingProgram(RLMCodingConfig())
-    agent.push_to_hub(MODAIC_REPO_PATH, commit_message="Switch to RLM instead of ReAct", tag="v0.0.1")
+    agent.push_to_hub(MODAIC_REPO_PATH, commit_message="Switch to RLM instead of ReAct", tag="v0.0.3")
    #main()
--- a/program.json
+++ b/program.json
@@ -4,7 +4,7 @@
    "train": [],
    "demos": [],
    "signature": {
-      "instructions": "You are a concise coding assistant. Help the user with their coding task by using the available tools to read, write, edit files, search the codebase, and run commands.\n\nYou are tasked with producing the following outputs given the inputs `task`:\n- {answer}\n- {affected_files}        # note: the value you produce must adhere to the JSON schema: {\"type\": \"array\", \"items\": {\"type\": \"string\"}}\n\nYou have access to a Python REPL environment. Write Python code and it will be executed. You will see the output, then write more code based on what you learned. This is an iterative process.\n\nAvailable:\n- Variables: `task` (your input data)\n- `llm_query(prompt)` - query a sub-LLM (~500K char capacity) for semantic analysis\n- `llm_query_batched(prompts)` - query multiple prompts concurrently (much faster for multiple queries)\n- `print()` - ALWAYS print to see results\n- `SUBMIT(answer, affected_files)` - submit final output when done\n- Standard libraries: re, json, collections, math, etc.\n\nIMPORTANT: This is ITERATIVE. Each code block you write will execute, you'll see the output, then you decide what to do next. Do NOT try to solve everything in one step.\n\n1. EXPLORE FIRST - Look at your data before processing it. Print samples, check types/lengths, understand the structure.\n2. ITERATE - Write small code snippets, observe outputs, then decide next steps. State persists between iterations.\n3. VERIFY BEFORE SUBMITTING - If results seem wrong (zeros, empty, unexpected), reconsider your approach.\n4. USE llm_query FOR SEMANTICS - String matching finds WHERE things are; llm_query understands WHAT things mean.\n5. MINIMIZE RETYPING (INPUTS & OUTPUTS) - When values are long, precise, or error-prone (IDs, numbers, code, quotes), re-access them via variables and parse/compute in code instead of retyping. Use small, targeted prints to sanity-check, but avoid manual copying when variables can carry the exact value.\n6. SUBMIT ONLY AFTER SEEING OUTPUTS - SUBMIT ends the current run immediately. If you need to inspect printed output, run it in one step, review the result, then call SUBMIT in a later step.\n\nYou have max 50 sub-LLM calls. When done, call SUBMIT() with your output.\nAdditional tools available (use these instead of standard library equivalents):\n- `readfile(path: str, offset: int, limit: int) -> str` - Read file contents with line numbers.\n- `writefile(path: str, content: str) -> str` - Write content to a file.\n- `editfile(path: str, old: str, new: str, replace_all: bool) -> str` - Replace text in a file.\n- `globfiles(pattern: str, path: str) -> str` - Find files matching a glob pattern, sorted by modification time.\n- `grepfiles(pattern: str, path: str) -> str` - Search files for a regex pattern.\n- `runbash(cmd: str) -> str` - Run a shell command and return output.",
+      "instructions": "You are a concise coding assistant. Help the user with their coding task by using the available tools to read, write, edit files, search the codebase, and run commands.\n\nYou are tasked with producing the following outputs given the inputs `task`:\n- {answer}\n- {affected_files}        # note: the value you produce must adhere to the JSON schema: {\"type\": \"array\", \"items\": {\"type\": \"string\"}}\n\nYou have access to a Python REPL environment. Write Python code and it will be executed. You will see the output, then write more code based on what you learned. This is an iterative process.\n\nAvailable:\n- Variables: `task` (your input data)\n- `llm_query(prompt)` - query a sub-LLM (~500K char capacity) for semantic analysis\n- `llm_query_batched(prompts)` - query multiple prompts concurrently (much faster for multiple queries)\n- `print()` - ALWAYS print to see results\n- `SUBMIT(answer, affected_files)` - submit final output when done\n- Standard libraries: re, json, collections, math, etc.\n\nIMPORTANT: This is ITERATIVE. Each code block you write will execute, you'll see the output, then you decide what to do next. Do NOT try to solve everything in one step.\n\n1. EXPLORE FIRST - Look at your data before processing it. Print samples, check types/lengths, understand the structure.\n2. ITERATE - Write small code snippets, observe outputs, then decide next steps. State persists between iterations.\n3. VERIFY BEFORE SUBMITTING - If results seem wrong (zeros, empty, unexpected), reconsider your approach.\n4. USE llm_query FOR SEMANTICS - String matching finds WHERE things are; llm_query understands WHAT things mean.\n5. MINIMIZE RETYPING (INPUTS & OUTPUTS) - When values are long, precise, or error-prone (IDs, numbers, code, quotes), re-access them via variables and parse/compute in code instead of retyping. Use small, targeted prints to sanity-check, but avoid manual copying when variables can carry the exact value.\n6. SUBMIT ONLY AFTER SEEING OUTPUTS - SUBMIT ends the current run immediately. If you need to inspect printed output, run it in one step, review the result, then call SUBMIT in a later step.\n\nYou have max 50 sub-LLM calls. When done, call SUBMIT() with your output.\nAdditional tools available (use these instead of standard library equivalents):\n- `read_file(path: str, offset: int, limit: int) -> str` - Read file contents with line numbers.\n- `write_file(path: str, content: str) -> str` - Write content to a file.\n- `edit_file(path: str, old: str, new: str, replace_all: bool) -> str` - Replace text in a file.\n- `glob_files(pattern: str, path: str) -> str` - Find files matching a glob pattern, sorted by modification time.\n- `grep_files(pattern: str, path: str) -> str` - Search files for a regex pattern.\n- `run_bash(cmd: str) -> str` - Run a shell command and return output.",
      "fields": [
        {
          "prefix": "Variables Info:",
@@ -29,7 +29,7 @@
      ]
    },
    "lm": {
-      "model": "openai/gpt-5.2-codex",
+      "model": "openrouter/anthropic/claude-3.5-sonnet",
      "model_type": "chat",
      "cache": true,
      "num_retries": 3,
@@ -37,8 +37,9 @@
      "launch_kwargs": {},
      "train_kwargs": {},
      "temperature": null,
-      "max_tokens": 16000,
+      "max_tokens": 32000,
-      "api_base": "https://openrouter.ai/api/v1"
+      "api_base": "https://openrouter.ai/api/v1",
      "track_usage": true
    }
  },
  "agent.extract": {
@@ -67,7 +68,7 @@
      ]
    },
    "lm": {
-      "model": "openai/gpt-5.2-codex",
+      "model": "openrouter/anthropic/claude-3.5-sonnet",
      "model_type": "chat",
      "cache": true,
      "num_retries": 3,
@@ -75,8 +76,9 @@
      "launch_kwargs": {},
      "train_kwargs": {},
      "temperature": null,
-      "max_tokens": 16000,
+      "max_tokens": 32000,
-      "api_base": "https://openrouter.ai/api/v1"
+      "api_base": "https://openrouter.ai/api/v1",
      "track_usage": true
    }
  },
  "metadata": {
--- a/pyproject.toml
+++ b/pyproject.toml
@@ -4,4 +4,4 @@ version = "0.1.0"
 description = "Add your description here"
 readme = "README.md"
 requires-python = ">=3.13"
-dependencies = ["dspy>=3.1.2", "modaic>=0.10.3"]
+dependencies = ["dspy>=3.1.2", "modaic>=0.10.4"]
Author	SHA1	Message	Date
Farouk Adeleke	80629105ed	Switch to RLM instead of ReAct	2026-01-21 19:57:30 -08:00
Farouk Adeleke	25c75bc89a	Switch to RLM instead of ReAct	2026-01-21 18:53:07 -08:00