Remote control any AI coding agent from your phone.
English Β· TiαΊΏng Viα»t
Usage Β· Voice Β· Architecture Β· Deployment Β· Contributing
Your AI coding agents (Kiro, Claude Code, Gemini, Codex) live on your dev machine. RemoClaw makes them controllable from your phone β over Telegram, with streaming responses, voice in/out, and decision forwarding.
Use it when:
- ποΈ Reviewing a PR from the couch
- π Triaging a bug on the bus
- π€ Pair-programming hands-free while cooking
- π Juggling multiple projects across separate threads
You send a message on Telegram β RemoClaw forwards it to the AI agent on your dev machine β the agent's response streams back to your phone in real-time. Voice messages, decision prompts (Y/n), and screenshots all flow through the same pipe.
For technical details, see the architecture doc.
git clone https://github.com/quangtam/remoclaw.git
cd remoclaw
bash setup.sh # Windows: setup.bat
./remoclaw start # Windows: remoclaw startThe setup wizard handles Python venv, dependencies, Telegram token, and .env. After it finishes, login your CLI once (kiro-cli login / claude login / gemini auth / codex login) and start chatting.
Note: Bot commands are registered automatically on first start β no need to configure them via BotFather.
Need manual setup or more control? See the deployment guide.
| Agent | Auth |
|---|---|
| Kiro CLI | kiro-cli login |
| Claude Code | claude login |
| Gemini CLI | gemini auth |
| OpenAI Codex | codex login |
All agents authenticate via browser login on the host machine. RemoClaw uses the local session β no API keys required for the agent itself.
- π‘ Multi-agent β pluggable drivers for Kiro, Claude Code, Gemini, Codex
- β‘ Real-time streaming β progressive message edits, ChatGPT-like UX
- π€ Voice in/out β Whisper transcription + TTS readback, free local backends
- π Multi-project β bind threads to different project directories, switch with
/projects β οΈ Decision forwarding β CLI Y/n prompts piped to chat, your reply piped back- πΈ Screenshot auto-forward β image paths in CLI output sent as Telegram photos
- π§΅ Concurrent sessions β multiple threads run in parallel, idle auto-cleanup
- πΎ Persistent config β thread bindings and preferences survive bot restarts
/help Full command reference
/info Current session details
/sessions List all active sessions
/project /path/to/repo Bind thread to a project
/provider claude Switch CLI provider
/voice Toggle voice output
/git status Run git status on the project
Full reference β docs/usage.md.
| Topic | Doc |
|---|---|
| Usage & all commands | docs/usage.md |
| Voice configuration | docs/voice.md |
| Architecture | docs/architecture.md |
| Deployment (systemd, Docker) | docs/deployment-guide.md |
| Add a new CLI provider | docs/development-guide.md |
- Single bot instance per Telegram token (409 Conflict otherwise)
- Voice speed control is most precise on OpenAI TTS; edge-tts uses rate-percent mapping
- AI IDEs without a headless CLI (Cursor, Windsurf, VS Code) aren't supported yet β RemoClaw bridges to CLI agents
MIT β free to use, modify, and distribute.
