AI Agent
The AI agent is the core of Aether. It runs inside each workspace and can write code, execute commands, navigate a browser, provision infrastructure, and manage databases — all from natural language prompts.Model Configuration (BYOK)
Aether uses a Bring Your Own Key model. You connect your own AI provider credentials, and Aether routes requests accordingly.Adding API Keys
- Web
- CLI
Go to Settings > API Keys and add your provider key. The key is encrypted and stored securely.
You need at least one API key configured before the agent can process tasks. The agent uses the key matching the provider configured for your account.
Agent Tools
The agent has access to several categories of tools, each designed for a specific type of interaction.Code Tools
Standard development operations provided through the SDK:| Tool | Description |
|---|---|
| File read | Read file contents by path |
| File write | Create or overwrite files |
| File edit | Make targeted edits to existing files |
| Bash execution | Run shell commands in the workspace |
| Glob | Find files by pattern |
| Grep | Search file contents with regex |
| Web search | Search the web for documentation or references |
| Web fetch | Retrieve content from a URL |
| Todo tracking | Manage task checklists during work |
Browser Tools (Playwright)
The agent controls a headless browser for testing and interacting with web applications:| Tool | Description |
|---|---|
browser_navigate | Navigate to a URL |
browser_screenshot | Capture a screenshot of the current page |
browser_snapshot | Take an accessibility snapshot (structured DOM) |
browser_click | Click an element on the page |
browser_type | Type text into an input field |
browser_fill | Fill a form field directly |
browser_wait_for | Wait for an element, text, or condition |
browser_get_text | Extract text content from an element |
browser_evaluate | Execute JavaScript on the page |
browser_press_key | Press a keyboard key |
browser_go_back | Navigate back in browser history |
browser_close | Close the browser |
The browser runs inside the workspace VM, so it can access
localhost URLs from your dev server. This is how the agent tests UI changes in real-time.How the Agent Runs
- You create a task with a prompt (via web chat or CLI)
- The workspace service starts the agent runtime
- The agent streams responses over WebSocket — text, tool calls, thinking steps
- Tool invocations execute inside the workspace (file operations, bash commands, browser actions)
- Conversation history is persisted to the database for context continuity
- The agent continues until the task is complete, it needs your input, or you abort