🐙TakoAPI
← All scenarios

🖼️Creative & Media AI agents

AI agents for Creative & Media, callable through one unified API.

30 agents

VAP Execution Agent

Execution Control Layer for AI Agents. VAP is where nondeterminism stops. If your agents call paid APIs directly, you don't have cost control. VAP enforces pre-commit pricing, hard budget guarantees, deterministic retry behavior, and explicit execution ownership.

🖼️Creative & Media
A2A
Free
5 skills

MetaVision AI Studio

AI-powered creative platform. Generate 3D models from text or image, create music, video and images with AI. CNC G-code analysis. Built on Base network.

🖼️Creative & Media
A2A
Free
4 skills

LocalAI

project

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

🧰Agent Frameworks🖼️Creative & Media
mudler/LocalAIself-host47.1k

ChatTTS

project

A generative speech model for daily dialogue.

🖼️Creative & Media
2noise/ChatTTSself-host39.5k

mastra

project

Mastra is the modern TypeScript framework for AI-powered applications and agents.

🧰Agent Frameworks🎨Frontend
mastra-ai/mastraself-host25.5k

OpenMontage

project

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

💻Coding & Dev🖼️Creative & Media
calesthio/OpenMontageself-host22.5k

leon

project

🧠 Leon is your open-source personal assistant.

🖼️Creative & Media💬Customer Support
leon-ai/leonself-host17.3k

waoowaoo

project

首家工业级全流程 AI 影视生产平台。Industry-first professional AI Agent platform for controllable film & video production. From shorts to live-action with Hollywood-standard workflows.

🧰Agent Frameworks🖼️Creative & Media
waooAI/waoowaooself-host12.9k

agents

project

A framework for building realtime voice AI agents 🤖🎙️📹

🧰Agent Frameworks🖼️Creative & Media
livekit/agentsself-host11.1k

voltagent

project

AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework

🧰Agent Frameworks☁️DevOps & Automation
VoltAgent/voltagentself-host9.8k

Vision-Agents

project

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

🧰Agent Frameworks🖼️Creative & Media
GetStream/Vision-Agentsself-host8k

big-AGI

project

AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. Includes AI personas, AGI functions, world-class Beam multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.

🖼️Creative & Media
enricoros/big-AGIself-host7k

ian-xiaohei-illustrations

project

中文小黑怪诞正文配图生成 Skill | 16:9 白底手绘 | 少量红橙蓝批注 | Codex Skill

💻Coding & Dev🖼️Creative & Media
helloianneo/ian-xiaohei-illustrationsself-host6.2k

tokscale

project

🛰️ A CLI tool for tracking token usage from OpenCode, Claude Code, 🦞OpenClaw (Clawdbot/Moltbot), Pi, Codex, Gemini, Cursor, AmpCode, Factory Droid, Kimi, and more! • 🏅Global Leaderboard + 2D/3D Contributions Graph

💻Coding & Dev🧰Agent Frameworks
junhoyeo/tokscaleself-host3.7k

awesome-generative-ai

project

A curated list of Generative AI tools, works, models, and references

🖼️Creative & Media
filipecalegario/awesome-generative-aiself-host3.5k

ArcReel

project

AI Agent 驱动的开源视频生成工作台 — 小说→角色/场景/道具设计→剧本→分镜图→视频,跨镜头角色与场景一致 | Open-source AI video workspace powered by AI Agents, Nano Banana 2 & Veo 3.1 / Grok / Seedance / OpenAI

🧰Agent Frameworks☁️DevOps & Automation
ArcReel/ArcReelself-host2.9k

Android-MVVM-Architecture-Android-Voice-AI-SDK

project

Voice AI SDK is a reusable Android library that gives any app a full voice-driven AI conversation pipeline in minutes. Voice Assistant + Android Voide AI + SDK + MVVM + Kotlin

🖼️Creative & Media
ahmedeltaher/Android-MVVM-Architecture-Android-Voice-AI-SDKself-host2.6k

py-gpt

project

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, speech synthesis and recognition, web search, memory, presets, assistants,and more. Linux, Windows, Mac

🔍Research & Search🖼️Creative & Media
szczyglis-dev/py-gptself-host1.8k

Code2Video

project

[ICML 2026] Video generation via code

💻Coding & Dev🧰Agent Frameworks
showlab/Code2Videoself-host1.8k

ava-whatsapp-agent-course

project

Meet Ava, the WhatsApp Agent

🧰Agent Frameworks🖼️Creative & Media
neural-maze/ava-whatsapp-agent-courseself-host1.7k

Director

project

AI video agents framework for next-gen video interactions and workflows.

🧰Agent Frameworks🖼️Creative & Media
video-db/Directorself-host1.4k

langchain4j-aideepin

project

基于AI的工作效率提升工具(聊天、绘画、知识库、工作流、 MCP服务市场、语音输入输出、长期记忆) | Ai-based productivity tools (Chat,Draw,RAG,Workflow,MCP marketplace, ASR,TTS, Long-term memory etc)

🖼️Creative & Media🔍Research & Search
moyangzhan/langchain4j-aideepinself-host1.3k

ian-handdrawn-ppt

project

中文手绘技术 PPT 整页图像生成 Skill | 21:9 封面 + 16:9 正文配图 | PNG 输出

🖼️Creative & Media📑Presentations & Docs
helloianneo/ian-handdrawn-pptself-host1k

open-webui-tools

project

Open‑WebUI Tools is a modular toolkit designed to extend and enrich your Open WebUI instance, turning it into a powerful AI workstation. With a suite of over 15 specialized tools, function pipelines, and filters, this project supports academic research, agentic autonomy, multimodal creativity, workflows, and more

🧰Agent Frameworks🖼️Creative & Media
Haervwe/open-webui-toolsself-host755

voice-ai

project

Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, multi-channel integration, agent state management, and observability.

🖼️Creative & Media🧰Agent Frameworks
rapidaai/voice-aiself-host679

the-delegation

project

A no-code 3D playground to explore, design, and interact with Agentic AI systems

🧰Agent Frameworks🖼️Creative & Media
arturitu/the-delegationself-host434

ChatSim

project

[CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration

🖼️Creative & Media
yifanlu0227/ChatSimself-host429

ai-agent-tools

project

A curated collection of AI tools, utilities, and resources for developers and creators

💻Coding & Dev🧰Agent Frameworks
cporter202/ai-agent-toolsself-host421

sapphire

project

She's the AI agent you come home to.

🧰Agent Frameworks☁️DevOps & Automation
ddxfish/sapphireself-host264

Trend2Video-Pro

project

Trend-to-Video Agent Framework for publish-ready short video packages

🧰Agent Frameworks🖼️Creative & Media
2417467487-hub/Trend2Video-Proself-host212