coasty-ai/open-computer-use

每日信息看板 · 2026-03-08
开源项目
Category
github_search
Source
0
Score
2026-03-08T01:46:52Z
Published

AI 总结

coasty-ai 发布开源项目 Open Computer Use,让 AI 代理像人一样操作浏览器、终端和桌面执行多步骤任务,重要性在于把“会聊天”的 AI 扩展为可实际完成工作流的可控自动化平台。
#GitHub #repo #开源项目 #Electron #FastAPI #Agent

内容摘录

<div align="center">

<img src="assets/coasty.png" alt="Coasty" width="120"/>

<br />
Open Computer Use

**AI agents that control computers like humans do.**

Browser automation · Terminal access · Desktop control · Multi-agent orchestration

<br />

Website · Discord · Twitter

<br />

License
PRs Welcome

</div>

<br />

---

<br />
See it in action

<table>
<tr>
<td align="center" width="50%">
<a href="https://www.youtube.com/watch?v=icxgLDephHE">
<img src="https://img.youtube.com/vi/icxgLDephHE/maxresdefault.jpg" alt="Marketing on Reddit" width="100%"/>
</a>
<br />
<strong>Marketing</strong> — Market your product on Reddit autonomously
<br />
<a href="https://coasty.ai/share/373c1f67-afec-4bd6-adda-3809ecdbdd75"><sub>View chat session</sub></a>
</td>
<td align="center" width="50%">
<a href="https://www.youtube.com/watch?v=qTvmGfg3HVw">
<img src="https://img.youtube.com/vi/qTvmGfg3HVw/maxresdefault.jpg" alt="Go-to-Market Outreach" width="100%"/>
</a>
<br />
<strong>Go-to-Market</strong> — Find prospects and send personalized emails
<br />
<a href="https://coasty.ai/share/425d3c49-3a06-41e5-9859-aa00c5b12f3d"><sub>View chat session</sub></a>
</td>
</tr>
<tr>
<td align="center">
<a href="https://www.youtube.com/watch?v=Wbo2o74hVIo">
<img src="https://img.youtube.com/vi/Wbo2o74hVIo/maxresdefault.jpg" alt="QA Testing" width="100%"/>
</a>
<br />
<strong>QA Testing</strong> — Test every checkout flow and report bugs
<br />
<a href="https://coasty.ai/share/7ee3e942-c5dd-4e49-93b6-353bb5273b7e"><sub>View chat session</sub></a>
</td>
<td align="center">
<a href="https://www.youtube.com/watch?v=mH-csaCa508">
<img src="https://img.youtube.com/vi/mH-csaCa508/maxresdefault.jpg" alt="Job Application" width="100%"/>
</a>
<br />
<strong>Job Application</strong> — Find roles, tailor your resume, and apply
<br />
<a href="https://coasty.ai/share/4ac6f3d2-c273-4a07-bf98-b986d1cbfb88"><sub>View chat session</sub></a>
</td>
</tr>
<tr>
<td align="center">
<a href="https://www.youtube.com/watch?v=AnHJuRMLCnE">
<img src="https://img.youtube.com/vi/AnHJuRMLCnE/maxresdefault.jpg" alt="Form Filling" width="100%"/>
</a>
<br />
<strong>Form Filling</strong> — Fill out the YC S26 application for you
<br />
<a href="https://coasty.ai/share/60a0722b-fb98-43d6-a4e7-951d80a22363"><sub>View chat session</sub></a>
</td>
<td align="center">
<a href="https://www.youtube.com/watch?v=A_OvNh51Npg">
<img src="https://img.youtube.com/vi/A_OvNh51Npg/maxresdefault.jpg" alt="Social Media" width="100%"/>
</a>
<br />
<strong>Social Media</strong> — Post on Hacker News and engage with comments
<br />
<a href="https://coasty.ai/share/d181de46-b41d-4b87-9648-0374b2b7ec1c"><sub>View chat session</sub></a>
</td>
</tr>
</table>

<br />

---

<br />
What is this?

Open Computer Use is an open-source platform that gives AI agents real computer control. Unlike chatbots that only *talk* about tasks, agents here **actually perform them** — browsing the web, running commands, clicking through UIs, and orchestrating multi-step workflows in isolated containers.
Computer use capabilities similar to Anthropic's Claude Computer Use, but fully open-source and extensible.

<br />

---

<br />
Agents

**Browser** — Search-first web navigation, form filling, element interaction, multi-tab management, screenshot capture.

**Terminal** — Command execution, file operations, script running, package management, output streaming.

**Desktop** — Mouse & keyboard control, window management, screenshot analysis, UI element detection via computer vision.

**Planner** — Decomposes complex requests into subtasks, assigns to specialized agents, passes context between steps.

<br />

---

<br />
Architecture

<br />

---

<br />
Quick Start
Prerequisites

Node.js 20+ · Python 3.10+ · Docker · Supabase account · AI provider API key
Clone & install
Configure environment

Set these in both .env files:
Set up database
Run

**Docker (recommended):**

**Manual:**

Open **http://localhost:3000**, sign in, start a chat, and give your agent a task.

<br />

---

<br />
Tech Stack

| Layer | Technologies |
| --- | --- |
| **Frontend** | Next.js 15, React 19, TypeScript, Tailwind CSS, Radix UI, Zustand, Vercel AI SDK |
| **Backend** | FastAPI, Python 3.10+, WebSockets, asyncio, uvicorn |
| **AI Providers** | OpenAI, Anthropic, Google, Azure, xAI, Mistral, Perplexity, OpenRouter |
| **Infrastructure** | Docker, Ubuntu 22.04 + XFCE, Chrome, Selenium, Supabase, Stripe |
| **Desktop App** | Electron 40, Puppeteer-core, platform-native automation (Win32 / CoreGraphics / xdotool) |

<br />

---

<br />
Electron Desktop App

A lightweight overlay that runs AI agent commands directly on your local machine instead of a remote VM.
Floating always-on-top pill UI with expanded chat panel
Platform-native automation (PowerShell/Win32 on Windows, CoreGraphics/osascript on macOS, xdotool on Linux)
Browser control via Puppeteer-core, shell execution, file operations
WebSocket bridge to backend with auto-reconnect

<br />

---

<br />
Project Structure

<br />

---

<br />
Contributing
Fork the repo
Create a branch: git checkout -b feature/your-feature
Commit your changes
Open a pull request

Bug reports and feature requests welcome in Issues.

<br />

---

<br />
Roadmap
[ ] Multi-VM parallel orchestration
[ ] Visual workflow builder
[ ] Agent marketplace & templates
[ ] Windows / macOS VM support
[ ] Plugin system for custom tools
[ ] Collaborative sessions
[ ] Voice control & video understanding

<br />

---

<br />
Responsible Use

This platform gives AI agents significant autonomy. Use it to automate repetitive tasks, testing, research, and content creation — not to violate terms of service, spam, or scrape without permission. Always use isolated environments, respect robots.txt, and follow data protection laws.

<br />

---

<br />
License

Apache License 2.0 — Copyright (c) 2025 Open Computer Use Contributors

<br />

---

<br />

<div align="center">

**Star on GitHub** · **Join Discor…