内容摘录
<div align="center">
<img src="assets/coasty.png" alt="Coasty" width="120"/>
<br />
Open Computer Use
**AI agents that control computers like humans do.**
Browser automation · Terminal access · Desktop control · Multi-agent orchestration
<br />
Website · Discord · Twitter
<br />
License
PRs Welcome
</div>
<br />
---
<br />
See it in action
<table>
<tr>
<td align="center" width="50%">
<a href="https://www.youtube.com/watch?v=icxgLDephHE">
<img src="https://img.youtube.com/vi/icxgLDephHE/maxresdefault.jpg" alt="Marketing on Reddit" width="100%"/>
</a>
<br />
<strong>Marketing</strong> — Market your product on Reddit autonomously
<br />
<a href="https://coasty.ai/share/373c1f67-afec-4bd6-adda-3809ecdbdd75"><sub>View chat session</sub></a>
</td>
<td align="center" width="50%">
<a href="https://www.youtube.com/watch?v=qTvmGfg3HVw">
<img src="https://img.youtube.com/vi/qTvmGfg3HVw/maxresdefault.jpg" alt="Go-to-Market Outreach" width="100%"/>
</a>
<br />
<strong>Go-to-Market</strong> — Find prospects and send personalized emails
<br />
<a href="https://coasty.ai/share/425d3c49-3a06-41e5-9859-aa00c5b12f3d"><sub>View chat session</sub></a>
</td>
</tr>
<tr>
<td align="center">
<a href="https://www.youtube.com/watch?v=Wbo2o74hVIo">
<img src="https://img.youtube.com/vi/Wbo2o74hVIo/maxresdefault.jpg" alt="QA Testing" width="100%"/>
</a>
<br />
<strong>QA Testing</strong> — Test every checkout flow and report bugs
<br />
<a href="https://coasty.ai/share/7ee3e942-c5dd-4e49-93b6-353bb5273b7e"><sub>View chat session</sub></a>
</td>
<td align="center">
<a href="https://www.youtube.com/watch?v=mH-csaCa508">
<img src="https://img.youtube.com/vi/mH-csaCa508/maxresdefault.jpg" alt="Job Application" width="100%"/>
</a>
<br />
<strong>Job Application</strong> — Find roles, tailor your resume, and apply
<br />
<a href="https://coasty.ai/share/4ac6f3d2-c273-4a07-bf98-b986d1cbfb88"><sub>View chat session</sub></a>
</td>
</tr>
<tr>
<td align="center">
<a href="https://www.youtube.com/watch?v=AnHJuRMLCnE">
<img src="https://img.youtube.com/vi/AnHJuRMLCnE/maxresdefault.jpg" alt="Form Filling" width="100%"/>
</a>
<br />
<strong>Form Filling</strong> — Fill out the YC S26 application for you
<br />
<a href="https://coasty.ai/share/60a0722b-fb98-43d6-a4e7-951d80a22363"><sub>View chat session</sub></a>
</td>
<td align="center">
<a href="https://www.youtube.com/watch?v=A_OvNh51Npg">
<img src="https://img.youtube.com/vi/A_OvNh51Npg/maxresdefault.jpg" alt="Social Media" width="100%"/>
</a>
<br />
<strong>Social Media</strong> — Post on Hacker News and engage with comments
<br />
<a href="https://coasty.ai/share/d181de46-b41d-4b87-9648-0374b2b7ec1c"><sub>View chat session</sub></a>
</td>
</tr>
</table>
<br />
---
<br />
What is this?
Open Computer Use is an open-source platform that gives AI agents real computer control. Unlike chatbots that only *talk* about tasks, agents here **actually perform them** — browsing the web, running commands, clicking through UIs, and orchestrating multi-step workflows in isolated containers.
Computer use capabilities similar to Anthropic's Claude Computer Use, but fully open-source and extensible.
<br />
---
<br />
Agents
**Browser** — Search-first web navigation, form filling, element interaction, multi-tab management, screenshot capture.
**Terminal** — Command execution, file operations, script running, package management, output streaming.
**Desktop** — Mouse & keyboard control, window management, screenshot analysis, UI element detection via computer vision.
**Planner** — Decomposes complex requests into subtasks, assigns to specialized agents, passes context between steps.
<br />
---
<br />
Architecture
<br />
---
<br />
Quick Start
Prerequisites
Node.js 20+ · Python 3.10+ · Docker · Supabase account · AI provider API key
Clone & install
Configure environment
Set these in both .env files:
Set up database
Run
**Docker (recommended):**
**Manual:**
Open **http://localhost:3000**, sign in, start a chat, and give your agent a task.
<br />
---
<br />
Tech Stack
| Layer | Technologies |
| --- | --- |
| **Frontend** | Next.js 15, React 19, TypeScript, Tailwind CSS, Radix UI, Zustand, Vercel AI SDK |
| **Backend** | FastAPI, Python 3.10+, WebSockets, asyncio, uvicorn |
| **AI Providers** | OpenAI, Anthropic, Google, Azure, xAI, Mistral, Perplexity, OpenRouter |
| **Infrastructure** | Docker, Ubuntu 22.04 + XFCE, Chrome, Selenium, Supabase, Stripe |
| **Desktop App** | Electron 40, Puppeteer-core, platform-native automation (Win32 / CoreGraphics / xdotool) |
<br />
---
<br />
Electron Desktop App
A lightweight overlay that runs AI agent commands directly on your local machine instead of a remote VM.
Floating always-on-top pill UI with expanded chat panel
Platform-native automation (PowerShell/Win32 on Windows, CoreGraphics/osascript on macOS, xdotool on Linux)
Browser control via Puppeteer-core, shell execution, file operations
WebSocket bridge to backend with auto-reconnect
<br />
---
<br />
Project Structure
<br />
---
<br />
Contributing
Fork the repo
Create a branch: git checkout -b feature/your-feature
Commit your changes
Open a pull request
Bug reports and feature requests welcome in Issues.
<br />
---
<br />
Roadmap
[ ] Multi-VM parallel orchestration
[ ] Visual workflow builder
[ ] Agent marketplace & templates
[ ] Windows / macOS VM support
[ ] Plugin system for custom tools
[ ] Collaborative sessions
[ ] Voice control & video understanding
<br />
---
<br />
Responsible Use
This platform gives AI agents significant autonomy. Use it to automate repetitive tasks, testing, research, and content creation — not to violate terms of service, spam, or scrape without permission. Always use isolated environments, respect robots.txt, and follow data protection laws.
<br />
---
<br />
License
Apache License 2.0 — Copyright (c) 2025 Open Computer Use Contributors
<br />
---
<br />
<div align="center">
**Star on GitHub** · **Join Discor…