🖼️创意与媒体 AI 智能体
面向「创意与媒体」的 AI 智能体,通过一个统一 API 调用。
30 个智能体
VAP Execution Agent
Execution Control Layer for AI Agents. VAP is where nondeterminism stops. If your agents call paid APIs directly, you don't have cost control. VAP enforces pre-commit pricing, hard budget guarantees, deterministic retry behavior, and explicit execution ownership.
MetaVision AI Studio
AI-powered creative platform. Generate 3D models from text or image, create music, video and images with AI. CNC G-code analysis. Built on Base network.
LocalAI
项目LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
ChatTTS
项目A generative speech model for daily dialogue.
mastra
项目Mastra is the modern TypeScript framework for AI-powered applications and agents.
OpenMontage
项目World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.
leon
项目🧠 Leon is your open-source personal assistant.
waoowaoo
项目首家工业级全流程 AI 影视生产平台。Industry-first professional AI Agent platform for controllable film & video production. From shorts to live-action with Hollywood-standard workflows.
agents
项目A framework for building realtime voice AI agents 🤖🎙️📹
voltagent
项目AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework
Vision-Agents
项目Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.
big-AGI
项目AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. Includes AI personas, AGI functions, world-class Beam multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.
ian-xiaohei-illustrations
项目中文小黑怪诞正文配图生成 Skill | 16:9 白底手绘 | 少量红橙蓝批注 | Codex Skill
tokscale
项目🛰️ A CLI tool for tracking token usage from OpenCode, Claude Code, 🦞OpenClaw (Clawdbot/Moltbot), Pi, Codex, Gemini, Cursor, AmpCode, Factory Droid, Kimi, and more! • 🏅Global Leaderboard + 2D/3D Contributions Graph
awesome-generative-ai
项目A curated list of Generative AI tools, works, models, and references
ArcReel
项目AI Agent 驱动的开源视频生成工作台 — 小说→角色/场景/道具设计→剧本→分镜图→视频,跨镜头角色与场景一致 | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI
Android-MVVM-Architecture-Android-Voice-AI-SDK
项目Voice AI SDK is a reusable Android library that gives any app a full voice-driven AI conversation pipeline in minutes. Voice Assistant + Android Voide AI + SDK + MVVM + Kotlin
py-gpt
项目Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, speech synthesis and recognition, web search, memory, presets, assistants,and more. Linux, Windows, Mac
Code2Video
项目[ICML 2026] Video generation via code
ava-whatsapp-agent-course
项目Meet Ava, the WhatsApp Agent
Director
项目AI video agents framework for next-gen video interactions and workflows.
langchain4j-aideepin
项目基于AI的工作效率提升工具(聊天、绘画、知识库、工作流、 MCP服务市场、语音输入输出、长期记忆) | Ai-based productivity tools (Chat,Draw,RAG,Workflow,MCP marketplace, ASR,TTS, Long-term memory etc)
ian-handdrawn-ppt
项目中文手绘技术 PPT 整页图像生成 Skill | 21:9 封面 + 16:9 正文配图 | PNG 输出
open-webui-tools
项目Open‑WebUI Tools is a modular toolkit designed to extend and enrich your Open WebUI instance, turning it into a powerful AI workstation. With a suite of over 15 specialized tools, function pipelines, and filters, this project supports academic research, agentic autonomy, multimodal creativity, workflows, and more
voice-ai
项目Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, multi-channel integration, agent state management, and observability.
the-delegation
项目A no-code 3D playground to explore, design, and interact with Agentic AI systems
ChatSim
项目[CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration
ai-agent-tools
项目A curated collection of AI tools, utilities, and resources for developers and creators
sapphire
项目She's the AI agent you come home to.
Trend2Video-Pro
项目Trend-to-Video Agent Framework for publish-ready short video packages