2 Commits

Author SHA1 Message Date
80629105ed Switch to RLM instead of ReAct 2026-01-21 19:57:30 -08:00
25c75bc89a Switch to RLM instead of ReAct 2026-01-21 18:53:07 -08:00
5 changed files with 311 additions and 49 deletions

259
README.md
View File

@@ -1,6 +1,6 @@
# nanocode # nanocode
Minimal Claude Code alternative using DSPy ReAct! Single Python file, zero dependencies, ~250 lines. Minimal Claude Code alternative using DSPy RLM! Single Python file, ~390 lines.
Built using Claude Code, then used to build itself. Built using Claude Code, then used to build itself.
@@ -8,57 +8,272 @@ Built using Claude Code, then used to build itself.
## Features ## Features
- Full agentic loop with tool use - Full agentic loop with tool use via [DSPy RLM](https://dspy.ai/)
- Tools: `read`, `write`, `edit`, `glob`, `grep`, `bash` - Tools: `read`, `write`, `edit`, `glob`, `grep`, `bash`
- Conversation history - Conversation history with context
- Colored terminal output - Colored terminal output
- **Modaic Integration**: Push, version, and share as a [Modaic](https://modaic.dev) autoprogram
### OpenRouter ---
Use [OpenRouter](https://openrouter.ai) to access any model: ## Prerequisites
Before using nanocode (or any DSPy RLM-based program), you need to install the Deno code interpreter:
```bash
brew install deno
```
This is required for the RLM's code execution capabilities.
---
## Quick Start
### Option 1: Use as a Modaic AutoProgram
Load and run nanocode directly from the Modaic Hub without cloning:
```python
from modaic import AutoProgram
# Load the precompiled nanocode agent from Modaic Hub
agent = AutoProgram.from_precompiled(
"farouk1/nanocode",
config={
"lm": "openrouter/anthropic/claude-3.5-sonnet",
"max_iters": 20
}
)
# Run a coding task
result = agent(task="What Python files are in this directory?")
print(result.answer)
print(result.affected_files)
```
### Option 2: Run Locally (Interactive CLI)
```bash ```bash
export OPENROUTER_API_KEY="your-key" export OPENROUTER_API_KEY="your-key"
python nanocode.py python nanocode.py
``` ```
To use a different model: To use a specific model:
```bash ```bash
export OPENROUTER_API_KEY="your-key" export OPENROUTER_API_KEY="your-key"
export MODEL="openai/gpt-5.2" export MODEL="openai/gpt-4"
python nanocode.py python nanocode.py
``` ```
## Commands ---
- `/c` - Clear conversation ## Configuration
- `/q` or `exit` - Quit
When using as a Modaic AutoProgram, you can configure these options:
| Parameter | Type | Default | Description |
|-----------|------|---------|-------------|
| `lm` | str | `openrouter/anthropic/claude-3.5-sonnet` | Primary language model |
| `sub_lm` | str | `openrouter/openai/gpt-4.1` | Sub-LM for reasoning steps |
| `max_iters` | int | `20` | Maximum agent iterations |
| `api_base` | str | `https://openrouter.ai/api/v1` | API base URL |
| `max_tokens` | int | `16000` | Maximum tokens per request |
| `max_output_chars` | int | `100000` | Maximum output character limit |
| `verbose` | bool | `False` | Enable verbose logging |
Example with custom configuration:
```python
from modaic import AutoProgram
agent = AutoProgram.from_precompiled(
"farouk1/nanocode",
config={
"lm": "openrouter/openai/gpt-4",
"sub_lm": "openrouter/openai/gpt-3.5-turbo",
"max_iters": 30,
"max_tokens": 8000,
"verbose": True
}
)
```
---
## CLI Commands
| Command | Description |
|---------|-------------|
| `/c` | Clear conversation history |
| `/q` or `exit` | Quit the application |
---
## Tools ## Tools
| Tool | Description | The agent has access to the following tools:
|------|-------------|
| `read` | Read file with line numbers, offset/limit |
| `write` | Write content to file |
| `edit` | Replace string in file (must be unique) |
| `glob` | Find files by pattern, sorted by mtime |
| `grep` | Search files for regex |
| `bash` | Run shell command |
## Example | Tool | Function | Description |
|------|----------|-------------|
| `readfile` | `read_file(path, offset, limit)` | Read file contents with line numbers |
| `writefile` | `write_file(path, content)` | Write content to a file |
| `editfile` | `edit_file(path, old, new, replace_all)` | Replace text in a file (old must be unique unless `replace_all=True`) |
| `globfiles` | `glob_files(pattern, path)` | Find files matching a glob pattern, sorted by modification time |
| `grepfiles` | `grep_files(pattern, path)` | Search files for a regex pattern |
| `runbash` | `run_bash(cmd)` | Run a shell command and return output |
---
## Example Usage
### Interactive CLI
``` ```
──────────────────────────────────────── ────────────────────────────────────────
what files are here? what files are here?
──────────────────────────────────────── ────────────────────────────────────────
Glob(**/*.py) Thinking...
⎿ nanocode.py ⏺ globfiles(pattern='**/*', path='.')
There's one Python file: nanocode.py I found the following files:
- nanocode.py
- README.md
- modaic/SKILL.md
``` ```
### Programmatic Usage
```python
from modaic import AutoProgram
agent = AutoProgram.from_precompiled("farouk1/nanocode")
# Read a file
result = agent(task="Read the first 10 lines of nanocode.py")
print(result.answer)
# Search for patterns
result = agent(task="Find all functions that contain 'file' in their name")
print(result.answer)
# Make edits
result = agent(task="Add a comment at the top of README.md")
print(result.affected_files) # ['README.md']
```
---
## Architecture
### Overview
```
nanocode.py
├── File Operations
│ ├── read_file() - Read with line numbers
│ ├── write_file() - Write content
│ └── edit_file() - Find & replace
├── Search Operations
│ ├── glob_files() - Pattern matching
│ └── grep_files() - Regex search
├── Shell Operations
│ └── run_bash() - Execute commands
├── DSPy Components
│ ├── CodingAssistant (Signature)
│ └── RLMCodingProgram (PrecompiledProgram)
└── Modaic Integration
└── RLMCodingConfig (PrecompiledConfig)
```
### Key Classes
#### `RLMCodingConfig`
Configuration class extending `PrecompiledConfig` for experiment-specific parameters.
```python
class RLMCodingConfig(PrecompiledConfig):
max_iters: int = 20
lm: str = "openrouter/anthropic/claude-3.5-sonnet"
sub_lm: str = "openrouter/openai/gpt-4.1"
api_base: str = "https://openrouter.ai/api/v1"
max_tokens: int = 16000
max_output_chars: int = 100000
verbose: bool = False
```
#### `RLMCodingProgram`
Main program class extending `PrecompiledProgram`. Wraps a DSPy RLM agent with coding tools.
```python
class RLMCodingProgram(PrecompiledProgram):
config: RLMCodingConfig
def forward(self, task: str) -> dspy.Prediction:
# Returns prediction with .answer and .affected_files
return self.agent(task=task)
```
#### `CodingAssistant`
DSPy Signature defining the agent's input/output schema.
```python
class CodingAssistant(dspy.Signature):
task: str = dspy.InputField()
answer: str = dspy.OutputField()
affected_files: list[str] = dspy.OutputField()
```
---
## Publishing Your Own Version
If you modify nanocode and want to publish your own version to Modaic Hub:
```python
from nanocode import RLMCodingProgram, RLMCodingConfig
# Create and optionally optimize your program
program = RLMCodingProgram(RLMCodingConfig())
# Push to your Modaic Hub repo
program.push_to_hub(
"your-username/my-nanocode",
commit_message="My customized nanocode",
with_code=True # Include source code for AutoProgram loading
)
```
---
## Dependencies
- [DSPy](https://dspy.ai/) - Framework for programming language models
- [Modaic](https://modaic.dev/) - Hub for sharing and versioning DSPy programs
- OpenRouter API key (for accessing language models)
Install dependencies:
```bash
pip install dspy modaic
# or with uv
uv add dspy modaic
```
---
## Environment Variables
| Variable | Required | Description |
|----------|----------|-------------|
| `OPENROUTER_API_KEY` | Yes | Your OpenRouter API key |
| `MODEL` | No | Override the default model selection |
| `MODAIC_TOKEN` | For Hub | Required for pushing/loading from Modaic Hub |
---
## License ## License
MIT MIT

View File

@@ -1,10 +1,11 @@
{ {
"model": null, "model": null,
"max_iters": 20, "max_iters": 20,
"lm": "openai/gpt-5.2-codex", "lm": "openrouter/anthropic/claude-3.5-sonnet",
"sub_lm": "openrouter/openai/gpt-4.1", "sub_lm": "openrouter/openai/gpt-4.1",
"api_base": "https://openrouter.ai/api/v1", "api_base": "https://openrouter.ai/api/v1",
"max_tokens": 16000, "max_tokens": 32000,
"max_output_chars": 100000, "max_output_chars": 100000,
"verbose": false "verbose": false,
"track_usage": true
} }

View File

@@ -234,16 +234,6 @@ class CodingAssistant(dspy.Signature):
) )
tools = {
"readfile": read_file,
"writefile": write_file,
"editfile": edit_file,
"globfiles": glob_files,
"grepfiles": grep_files,
"runbash": run_bash,
}
class ToolLoggingCallback(BaseCallback): class ToolLoggingCallback(BaseCallback):
"""Callback that logs tool calls as they happen.""" """Callback that logs tool calls as they happen."""
@@ -278,9 +268,10 @@ class RLMCodingConfig(PrecompiledConfig):
lm: str = "openrouter/anthropic/claude-3.5-sonnet" # Default fallback lm: str = "openrouter/anthropic/claude-3.5-sonnet" # Default fallback
sub_lm: str = "openrouter/openai/gpt-4.1" # Default fallback sub_lm: str = "openrouter/openai/gpt-4.1" # Default fallback
api_base: str = "https://openrouter.ai/api/v1" api_base: str = "https://openrouter.ai/api/v1"
max_tokens: int = 16000 max_tokens: int = 32000
max_output_chars: int = 100000 max_output_chars: int = 100000
verbose: bool = False verbose: bool = False
track_usage: bool = True
class RLMCodingProgram(PrecompiledProgram): class RLMCodingProgram(PrecompiledProgram):
@@ -289,6 +280,14 @@ class RLMCodingProgram(PrecompiledProgram):
def __init__(self, config: RLMCodingConfig, **kwargs): def __init__(self, config: RLMCodingConfig, **kwargs):
self.config = config self.config = config
super().__init__(config, **kwargs) super().__init__(config, **kwargs)
self.tools = {
"read_file": read_file,
"write_file": write_file,
"edit_file": edit_file,
"glob_files": glob_files,
"grep_files": grep_files,
"run_bash": run_bash,
}
# tool logging for introspections on multi-turn conversations # tool logging for introspections on multi-turn conversations
dspy.settings.configure(callbacks=[ToolLoggingCallback()]) dspy.settings.configure(callbacks=[ToolLoggingCallback()])
@@ -296,16 +295,18 @@ class RLMCodingProgram(PrecompiledProgram):
self.config.lm, self.config.lm,
api_base=self.config.api_base, api_base=self.config.api_base,
max_tokens=self.config.max_tokens, max_tokens=self.config.max_tokens,
track_usage=self.config.track_usage,
) )
sub_lm = dspy.LM( sub_lm = dspy.LM(
self.config.sub_lm, self.config.sub_lm,
api_base=self.config.api_base, api_base=self.config.api_base,
max_tokens=self.config.max_tokens, max_tokens=self.config.max_tokens,
track_usage=self.config.track_usage,
) )
agent = dspy.RLM( agent = dspy.RLM(
CodingAssistant, CodingAssistant,
sub_lm=sub_lm, sub_lm=sub_lm,
tools=tools, tools=self.tools,
max_output_chars=self.config.max_output_chars, max_output_chars=self.config.max_output_chars,
max_iterations=self.config.max_iters, max_iterations=self.config.max_iters,
verbose=self.config.verbose, verbose=self.config.verbose,
@@ -318,6 +319,15 @@ class RLMCodingProgram(PrecompiledProgram):
assert task, "Task cannot be empty" assert task, "Task cannot be empty"
return self.agent(task=task) return self.agent(task=task)
def get_tools(self):
return self.tools
def set_tool(self, name: str, tool: callable):
self.tools[name] = tool
def remove_tool(self, name: str):
del self.tools[name]
def main(): def main():
model = os.getenv("MODEL") model = os.getenv("MODEL")
if model is None: if model is None:
@@ -332,7 +342,7 @@ def main():
agent = RLMCodingProgram(config) agent = RLMCodingProgram(config)
print( print(
f"{BOLD}nanocode-dspy{RESET} | {DIM}{agent.config.lm} | {os.getcwd()}{RESET}\n" f"{BOLD}NANOCODE DSPY{RESET} | {DIM}{agent.config.lm} | {os.getcwd()}{RESET}\n"
) )
# Conversation history for context # Conversation history for context
@@ -352,6 +362,37 @@ def main():
history = [] history = []
print(f"{GREEN}⏺ Cleared conversation{RESET}") print(f"{GREEN}⏺ Cleared conversation{RESET}")
continue continue
if user_input == "/model":
print(f"\n{BOLD}Current model: {agent.config.lm}{RESET}")
print(f"\n{BOLD}Select a new model:{RESET}")
for key, (name, model_id) in AVAILABLE_MODELS.items():
print(f" {BLUE}{key}{RESET}. {name} ({DIM}{model_id}{RESET})")
print(f" {BLUE}c{RESET}. Custom model (enter manually)")
print(f" {BLUE}k{RESET}. Keep current model")
choice = input(f"\n{BOLD}{BLUE}{RESET} Enter choice: ").strip().lower()
if choice == "k":
print(f"{GREEN}⏺ Keeping current model: {agent.config.lm}{RESET}")
continue
elif choice in AVAILABLE_MODELS:
name, model_id = AVAILABLE_MODELS[choice]
new_model = model_id if model_id.startswith("openrouter/") else f"openrouter/{model_id}"
config.lm = new_model
agent = RLMCodingProgram(config)
print(f"{GREEN}⏺ Switched to: {name} ({new_model}){RESET}")
elif choice == "c":
custom_model = input(f"{BOLD}{BLUE}{RESET} Enter model ID: ").strip()
if custom_model:
new_model = custom_model if custom_model.startswith("openrouter/") else f"openrouter/{custom_model}"
config.lm = new_model
agent = RLMCodingProgram(config)
print(f"{GREEN}⏺ Switched to custom model: {new_model}{RESET}")
else:
print(f"{RED}⏺ Invalid model ID, keeping current model{RESET}")
else:
print(f"{RED}⏺ Invalid choice, keeping current model{RESET}")
continue
# Build context from history # Build context from history
context = f"Working directory: {os.getcwd()}\n" context = f"Working directory: {os.getcwd()}\n"
@@ -364,11 +405,14 @@ def main():
print(f"\n{CYAN}{RESET} Thinking...", flush=True) print(f"\n{CYAN}{RESET} Thinking...", flush=True)
# Run the ReAct agent # Run the RLM agent
result = agent(task=task) result = agent(task=task)
# Display the answer # Display the answer
print(f"\n{CYAN}{RESET} {render_markdown(result.answer)}") print(f"\n{CYAN}{RESET} {render_markdown(result.answer)}")
# Display usage
print(f"\n{MAGENTA}⏺ Debug Prediction: {result}{RESET}")
# Save to history # Save to history
history.append({"user": user_input, "assistant": result.answer}) history.append({"user": user_input, "assistant": result.answer})
@@ -385,6 +429,6 @@ def main():
if __name__ == "__main__": if __name__ == "__main__":
agent = RLMCodingProgram(RLMCodingConfig(lm="openai/gpt-5.2-codex")) agent = RLMCodingProgram(RLMCodingConfig())
agent.push_to_hub(MODAIC_REPO_PATH, commit_message="Switch to RLM instead of ReAct", tag="v0.0.1") agent.push_to_hub(MODAIC_REPO_PATH, commit_message="Switch to RLM instead of ReAct", tag="v0.0.3")
#main() #main()

View File

@@ -4,7 +4,7 @@
"train": [], "train": [],
"demos": [], "demos": [],
"signature": { "signature": {
"instructions": "You are a concise coding assistant. Help the user with their coding task by using the available tools to read, write, edit files, search the codebase, and run commands.\n\nYou are tasked with producing the following outputs given the inputs `task`:\n- {answer}\n- {affected_files} # note: the value you produce must adhere to the JSON schema: {\"type\": \"array\", \"items\": {\"type\": \"string\"}}\n\nYou have access to a Python REPL environment. Write Python code and it will be executed. You will see the output, then write more code based on what you learned. This is an iterative process.\n\nAvailable:\n- Variables: `task` (your input data)\n- `llm_query(prompt)` - query a sub-LLM (~500K char capacity) for semantic analysis\n- `llm_query_batched(prompts)` - query multiple prompts concurrently (much faster for multiple queries)\n- `print()` - ALWAYS print to see results\n- `SUBMIT(answer, affected_files)` - submit final output when done\n- Standard libraries: re, json, collections, math, etc.\n\nIMPORTANT: This is ITERATIVE. Each code block you write will execute, you'll see the output, then you decide what to do next. Do NOT try to solve everything in one step.\n\n1. EXPLORE FIRST - Look at your data before processing it. Print samples, check types/lengths, understand the structure.\n2. ITERATE - Write small code snippets, observe outputs, then decide next steps. State persists between iterations.\n3. VERIFY BEFORE SUBMITTING - If results seem wrong (zeros, empty, unexpected), reconsider your approach.\n4. USE llm_query FOR SEMANTICS - String matching finds WHERE things are; llm_query understands WHAT things mean.\n5. MINIMIZE RETYPING (INPUTS & OUTPUTS) - When values are long, precise, or error-prone (IDs, numbers, code, quotes), re-access them via variables and parse/compute in code instead of retyping. Use small, targeted prints to sanity-check, but avoid manual copying when variables can carry the exact value.\n6. SUBMIT ONLY AFTER SEEING OUTPUTS - SUBMIT ends the current run immediately. If you need to inspect printed output, run it in one step, review the result, then call SUBMIT in a later step.\n\nYou have max 50 sub-LLM calls. When done, call SUBMIT() with your output.\nAdditional tools available (use these instead of standard library equivalents):\n- `readfile(path: str, offset: int, limit: int) -> str` - Read file contents with line numbers.\n- `writefile(path: str, content: str) -> str` - Write content to a file.\n- `editfile(path: str, old: str, new: str, replace_all: bool) -> str` - Replace text in a file.\n- `globfiles(pattern: str, path: str) -> str` - Find files matching a glob pattern, sorted by modification time.\n- `grepfiles(pattern: str, path: str) -> str` - Search files for a regex pattern.\n- `runbash(cmd: str) -> str` - Run a shell command and return output.", "instructions": "You are a concise coding assistant. Help the user with their coding task by using the available tools to read, write, edit files, search the codebase, and run commands.\n\nYou are tasked with producing the following outputs given the inputs `task`:\n- {answer}\n- {affected_files} # note: the value you produce must adhere to the JSON schema: {\"type\": \"array\", \"items\": {\"type\": \"string\"}}\n\nYou have access to a Python REPL environment. Write Python code and it will be executed. You will see the output, then write more code based on what you learned. This is an iterative process.\n\nAvailable:\n- Variables: `task` (your input data)\n- `llm_query(prompt)` - query a sub-LLM (~500K char capacity) for semantic analysis\n- `llm_query_batched(prompts)` - query multiple prompts concurrently (much faster for multiple queries)\n- `print()` - ALWAYS print to see results\n- `SUBMIT(answer, affected_files)` - submit final output when done\n- Standard libraries: re, json, collections, math, etc.\n\nIMPORTANT: This is ITERATIVE. Each code block you write will execute, you'll see the output, then you decide what to do next. Do NOT try to solve everything in one step.\n\n1. EXPLORE FIRST - Look at your data before processing it. Print samples, check types/lengths, understand the structure.\n2. ITERATE - Write small code snippets, observe outputs, then decide next steps. State persists between iterations.\n3. VERIFY BEFORE SUBMITTING - If results seem wrong (zeros, empty, unexpected), reconsider your approach.\n4. USE llm_query FOR SEMANTICS - String matching finds WHERE things are; llm_query understands WHAT things mean.\n5. MINIMIZE RETYPING (INPUTS & OUTPUTS) - When values are long, precise, or error-prone (IDs, numbers, code, quotes), re-access them via variables and parse/compute in code instead of retyping. Use small, targeted prints to sanity-check, but avoid manual copying when variables can carry the exact value.\n6. SUBMIT ONLY AFTER SEEING OUTPUTS - SUBMIT ends the current run immediately. If you need to inspect printed output, run it in one step, review the result, then call SUBMIT in a later step.\n\nYou have max 50 sub-LLM calls. When done, call SUBMIT() with your output.\nAdditional tools available (use these instead of standard library equivalents):\n- `read_file(path: str, offset: int, limit: int) -> str` - Read file contents with line numbers.\n- `write_file(path: str, content: str) -> str` - Write content to a file.\n- `edit_file(path: str, old: str, new: str, replace_all: bool) -> str` - Replace text in a file.\n- `glob_files(pattern: str, path: str) -> str` - Find files matching a glob pattern, sorted by modification time.\n- `grep_files(pattern: str, path: str) -> str` - Search files for a regex pattern.\n- `run_bash(cmd: str) -> str` - Run a shell command and return output.",
"fields": [ "fields": [
{ {
"prefix": "Variables Info:", "prefix": "Variables Info:",
@@ -29,7 +29,7 @@
] ]
}, },
"lm": { "lm": {
"model": "openai/gpt-5.2-codex", "model": "openrouter/anthropic/claude-3.5-sonnet",
"model_type": "chat", "model_type": "chat",
"cache": true, "cache": true,
"num_retries": 3, "num_retries": 3,
@@ -37,8 +37,9 @@
"launch_kwargs": {}, "launch_kwargs": {},
"train_kwargs": {}, "train_kwargs": {},
"temperature": null, "temperature": null,
"max_tokens": 16000, "max_tokens": 32000,
"api_base": "https://openrouter.ai/api/v1" "api_base": "https://openrouter.ai/api/v1",
"track_usage": true
} }
}, },
"agent.extract": { "agent.extract": {
@@ -67,7 +68,7 @@
] ]
}, },
"lm": { "lm": {
"model": "openai/gpt-5.2-codex", "model": "openrouter/anthropic/claude-3.5-sonnet",
"model_type": "chat", "model_type": "chat",
"cache": true, "cache": true,
"num_retries": 3, "num_retries": 3,
@@ -75,8 +76,9 @@
"launch_kwargs": {}, "launch_kwargs": {},
"train_kwargs": {}, "train_kwargs": {},
"temperature": null, "temperature": null,
"max_tokens": 16000, "max_tokens": 32000,
"api_base": "https://openrouter.ai/api/v1" "api_base": "https://openrouter.ai/api/v1",
"track_usage": true
} }
}, },
"metadata": { "metadata": {

View File

@@ -4,4 +4,4 @@ version = "0.1.0"
description = "Add your description here" description = "Add your description here"
readme = "README.md" readme = "README.md"
requires-python = ">=3.13" requires-python = ">=3.13"
dependencies = ["dspy>=3.1.2", "modaic>=0.10.3"] dependencies = ["dspy>=3.1.2", "modaic>=0.10.4"]