pipecat-ai/pipecat

每日信息看板 · 2026-03-10

返回当天 Daily Index

开源项目

AI 总结

Pipecat 是一个用于构建实时语音与多模态对话 AI 代理的开源 Python 框架，提供低延迟可组合管线与多服务集成，帮助开发者快速搭建并部署语音/视频交互应用。

面向实时语音与多模态代理：编排音频、视频、AI 服务、传输与对话管线
主打 voice-first、可插拔服务与模块化可组合 pipelines，强调超低延迟
生态包含多端 Client SDK（JS/React/React Native/Swift/Kotlin/C++/ESP32）及结构化会话 Flows
提供 CLI 用于项目创建、监控与部署，并配套调试器 Whisker 与终端看板 Tail
集成广泛服务：STT/LLM/TTS/S2S、WebRTC/WebSocket 传输、视频与观测指标等
最低 Python 3.10（推荐 3.12），支持通过 extras 安装第三方服务依赖

#GitHub #repo #开源项目 #Pipecat #WebRTC #Agent

原链接

内容摘录

<h1><div align="center">
 <img alt="pipecat" width="300px" height="auto" src="https://raw.githubusercontent.com/pipecat-ai/pipecat/main/pipecat.png">
</div></h1>

PyPI !Tests codecov Docs Discord Ask DeepWiki
🎙️ Pipecat: Real-Time Voice & Multimodal AI Agents

**Pipecat** is an open-source Python framework for building real-time voice and multimodal conversational agents. Orchestrate audio and video, AI services, different transports, and conversation pipelines effortlessly—so you can focus on what makes your agent unique.
Want to dive right in? Try the quickstart.
🚀 What You Can Build
**Voice Assistants** – natural, streaming conversations with AI
**AI Companions** – coaches, meeting assistants, characters
**Multimodal Interfaces** – voice, video, images, and more
**Interactive Storytelling** – creative tools with generative media
**Business Agents** – customer intake, support bots, guided flows
**Complex Dialog Systems** – design logic with structured conversations
🧠 Why Pipecat?
**Voice-first**: Integrates speech recognition, text-to-speech, and conversation handling
**Pluggable**: Supports many AI services and tools
**Composable Pipelines**: Build complex behavior from modular components
**Real-Time**: Ultra-low latency interaction with different transports (e.g. WebSockets or WebRTC)
🌐 Pipecat Ecosystem
📱 Client SDKs

Building client applications? You can connect to Pipecat from any platform using our official SDKs:

<a href="https://docs.pipecat.ai/client/js/introduction">JavaScript</a> | <a href="https://docs.pipecat.ai/client/react/introduction">React</a> | <a href="https://docs.pipecat.ai/client/react-native/introduction">React Native</a> |
<a href="https://docs.pipecat.ai/client/ios/introduction">Swift</a> | <a href="https://docs.pipecat.ai/client/android/introduction">Kotlin</a> | <a href="https://docs.pipecat.ai/client/c++/introduction">C++</a> | <a href="https://github.com/pipecat-ai/pipecat-esp32">ESP32</a>
🧭 Structured conversations

Looking to build structured conversations? Check out Pipecat Flows for managing complex conversational states and transitions.
🪄 Beautiful UIs

Want to build beautiful and engaging experiences? Checkout the Voice UI Kit, a collection of components, hooks and templates for building voice AI applications quickly.
🛠️ Create and deploy projects

Create a new project in under a minute with the Pipecat CLI. Then use the CLI to monitor and deploy your agent to production.
🔍 Debugging

Looking for help debugging your pipeline and processors? Check out Whisker, a real-time Pipecat debugger.
🖥️ Terminal

Love terminal applications? Check out Tail, a terminal dashboard for Pipecat.
🤖 Claude Code Skills

Use Pipecat Skills with Claude Code to scaffold projects, deploy to Pipecat Cloud, and more. Install the marketplace with:

and install any of the available plugins.
📺️ Pipecat TV Channel

Catch new features, interviews, and how-tos on our Pipecat TV channel.
🎬 See it in action

<p float="left">
 <a href="https://github.com/pipecat-ai/pipecat-examples/tree/main/simple-chatbot"><img src="https://raw.githubusercontent.com/pipecat-ai/pipecat-examples/main/simple-chatbot/image.png" width="400" /></a>&nbsp;
 <a href="https://github.com/pipecat-ai/pipecat-examples/tree/main/storytelling-chatbot"><img src="https://raw.githubusercontent.com/pipecat-ai/pipecat-examples/main/storytelling-chatbot/image.png" width="400" /></a>
 <br/>
 <a href="https://github.com/pipecat-ai/pipecat-examples/tree/main/translation-chatbot"><img src="https://raw.githubusercontent.com/pipecat-ai/pipecat-examples/main/translation-chatbot/image.png" width="400" /></a>&nbsp;
 <a href="https://github.com/pipecat-ai/pipecat/blob/main/examples/foundational/12-describe-video.py"><img src="https://github.com/pipecat-ai/pipecat/blob/main/examples/foundational/assets/moondream.png" width="400" /></a>
</p>
🧩 Available services

| Category | Services |
| ------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| Speech-to-Text | AssemblyAI, AWS, Azure, Cartesia, Deepgram, ElevenLabs, Fal Wizper, Gladia, Google, Gradium, Groq (Whisper), Hathora, NVIDIA Riva, OpenAI (Whisper), SambaNova (Whisper), Sarvam, Soniox, Speechmatics, Whisper |
| LLMs | Anthropic, AWS, Azure, Cerebras, DeepSeek, Fireworks AI, Gemini, Grok, Groq, Mistral, NVIDIA NIM, Ollama, OpenAI, OpenRouter, Perplexity, Qwen, SambaNova Together AI |
| Text-to-Speech | Async, AWS, Azure, Camb AI, Cartesia, Deepgram, ElevenLabs, Fish, Google, Gradium, Groq, Hathora, Hume, Inworld, LMNT, MiniMax, Neuphonic, NVIDIA Riva, OpenAI, Piper, PlayHT, Resemble, Rime, Sarvam, Spe…