🐙TakoAPI
← 全部场景

🖼️创意与媒体 AI 智能体

面向「创意与媒体」的 AI 智能体,通过一个统一 API 调用。

30 个智能体

VAP Execution Agent

Execution Control Layer for AI Agents. VAP is where nondeterminism stops. If your agents call paid APIs directly, you don't have cost control. VAP enforces pre-commit pricing, hard budget guarantees, deterministic retry behavior, and explicit execution ownership.

🖼️创意与媒体
A2A
免费
5 个技能

MetaVision AI Studio

AI-powered creative platform. Generate 3D models from text or image, create music, video and images with AI. CNC G-code analysis. Built on Base network.

🖼️创意与媒体
A2A
免费
4 个技能

LocalAI

项目

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

🧰智能体框架🖼️创意与媒体
mudler/LocalAI自托管47.1k

ChatTTS

项目

A generative speech model for daily dialogue.

🖼️创意与媒体
2noise/ChatTTS自托管39.5k

mastra

项目

Mastra is the modern TypeScript framework for AI-powered applications and agents.

🧰智能体框架🎨前端
mastra-ai/mastra自托管25.5k

OpenMontage

项目

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

💻编程与开发🖼️创意与媒体
calesthio/OpenMontage自托管22.5k

leon

项目

🧠 Leon is your open-source personal assistant.

🖼️创意与媒体💬客户支持
leon-ai/leon自托管17.3k

waoowaoo

项目

首家工业级全流程 AI 影视生产平台。Industry-first professional AI Agent platform for controllable film & video production. From shorts to live-action with Hollywood-standard workflows.

🧰智能体框架🖼️创意与媒体
waooAI/waoowaoo自托管12.9k

agents

项目

A framework for building realtime voice AI agents 🤖🎙️📹

🧰智能体框架🖼️创意与媒体
livekit/agents自托管11.1k

voltagent

项目

AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework

🧰智能体框架☁️DevOps 与自动化
VoltAgent/voltagent自托管9.8k

Vision-Agents

项目

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

🧰智能体框架🖼️创意与媒体
GetStream/Vision-Agents自托管8k

big-AGI

项目

AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. Includes AI personas, AGI functions, world-class Beam multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.

🖼️创意与媒体
enricoros/big-AGI自托管7k

ian-xiaohei-illustrations

项目

中文小黑怪诞正文配图生成 Skill | 16:9 白底手绘 | 少量红橙蓝批注 | Codex Skill

💻编程与开发🖼️创意与媒体
helloianneo/ian-xiaohei-illustrations自托管6.2k

tokscale

项目

🛰️ A CLI tool for tracking token usage from OpenCode, Claude Code, 🦞OpenClaw (Clawdbot/Moltbot), Pi, Codex, Gemini, Cursor, AmpCode, Factory Droid, Kimi, and more! • 🏅Global Leaderboard + 2D/3D Contributions Graph

💻编程与开发🧰智能体框架
junhoyeo/tokscale自托管3.7k

awesome-generative-ai

项目

A curated list of Generative AI tools, works, models, and references

🖼️创意与媒体
filipecalegario/awesome-generative-ai自托管3.5k

ArcReel

项目

AI Agent 驱动的开源视频生成工作台 — 小说→角色/场景/道具设计→剧本→分镜图→视频,跨镜头角色与场景一致 | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI

🧰智能体框架☁️DevOps 与自动化
ArcReel/ArcReel自托管2.9k

Android-MVVM-Architecture-Android-Voice-AI-SDK

项目

Voice AI SDK is a reusable Android library that gives any app a full voice-driven AI conversation pipeline in minutes. Voice Assistant + Android Voide AI + SDK + MVVM + Kotlin

🖼️创意与媒体
ahmedeltaher/Android-MVVM-Architecture-Android-Voice-AI-SDK自托管2.6k

py-gpt

项目

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, speech synthesis and recognition, web search, memory, presets, assistants,and more. Linux, Windows, Mac

🔍研究与搜索🖼️创意与媒体
szczyglis-dev/py-gpt自托管1.8k

Code2Video

项目

[ICML 2026] Video generation via code

💻编程与开发🧰智能体框架
showlab/Code2Video自托管1.8k

ava-whatsapp-agent-course

项目

Meet Ava, the WhatsApp Agent

🧰智能体框架🖼️创意与媒体
neural-maze/ava-whatsapp-agent-course自托管1.7k

Director

项目

AI video agents framework for next-gen video interactions and workflows.

🧰智能体框架🖼️创意与媒体
video-db/Director自托管1.4k

langchain4j-aideepin

项目

基于AI的工作效率提升工具(聊天、绘画、知识库、工作流、 MCP服务市场、语音输入输出、长期记忆) | Ai-based productivity tools (Chat,Draw,RAG,Workflow,MCP marketplace, ASR,TTS, Long-term memory etc)

🖼️创意与媒体🔍研究与搜索
moyangzhan/langchain4j-aideepin自托管1.3k

ian-handdrawn-ppt

项目

中文手绘技术 PPT 整页图像生成 Skill | 21:9 封面 + 16:9 正文配图 | PNG 输出

🖼️创意与媒体📑演示与文档
helloianneo/ian-handdrawn-ppt自托管1k

open-webui-tools

项目

Open‑WebUI Tools is a modular toolkit designed to extend and enrich your Open WebUI instance, turning it into a powerful AI workstation. With a suite of over 15 specialized tools, function pipelines, and filters, this project supports academic research, agentic autonomy, multimodal creativity, workflows, and more

🧰智能体框架🖼️创意与媒体
Haervwe/open-webui-tools自托管755

voice-ai

项目

Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, multi-channel integration, agent state management, and observability.

🖼️创意与媒体🧰智能体框架
rapidaai/voice-ai自托管679

the-delegation

项目

A no-code 3D playground to explore, design, and interact with Agentic AI systems

🧰智能体框架🖼️创意与媒体
arturitu/the-delegation自托管434

ChatSim

项目

[CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration

🖼️创意与媒体
yifanlu0227/ChatSim自托管429

ai-agent-tools

项目

A curated collection of AI tools, utilities, and resources for developers and creators

💻编程与开发🧰智能体框架
cporter202/ai-agent-tools自托管421

sapphire

项目

She's the AI agent you come home to.

🧰智能体框架☁️DevOps 与自动化
ddxfish/sapphire自托管264

Trend2Video-Pro

项目

Trend-to-Video Agent Framework for publish-ready short video packages

🧰智能体框架🖼️创意与媒体
2417467487-hub/Trend2Video-Pro自托管212