donghaozhang/video-agent-skill

First Python Package for Video Cli Generation and Processing

39 stars17 forksUpdated Jan 23, 2026
npx skills add donghaozhang/video-agent-skill

README

AI Content Generation Suite

A comprehensive AI content generation package with multiple providers and services, consolidated into a single installable package.

Python 3.10+ License: MIT Code style: black PyPI

โšก Production-ready Python package with comprehensive CLI, parallel execution, and enterprise-grade architecture

๐ŸŽฌ Demo Video

AI Content Generation Suite Demo

Click to watch the complete demo of AI Content Generation Suite in action

๐ŸŽจ Available AI Models

40+ AI models across 8 categories - showing top picks below. See full models reference for complete list.

Text-to-Image (Top Picks)

ModelCostBest For
nano_banana_pro$0.002Fast & high-quality
gpt_image_1_5$0.003GPT-powered generation

Image-to-Video (Top Picks)

ModelCostBest For
sora_2$0.40-1.20OpenAI quality
kling_2_6_pro$0.50-1.00Professional quality

Text-to-Video (Top Picks)

ModelCostBest For
sora_2$0.40-1.20OpenAI quality
kling_2_6_pro$0.35-1.40Quality + audio

๐Ÿ’ก Cost-Saving Tip: Use --mock flag for FREE validation: ai-content-pipeline generate-image --text "test" --mock

๐Ÿ“š View all 40+ models โ†’

๐Ÿท๏ธ Latest Release

PyPI Version GitHub Release

What's New in v1.0.18

  • โœ… Automated PyPI publishing via GitHub Actions
  • ๐Ÿ”ง Consolidated setup files for cleaner package structure
  • ๐ŸŽฏ All 40+ AI models with comprehensive parallel processing support
  • ๐Ÿ“ฆ Improved CI/CD workflow with skip-existing option

๐Ÿš€ FLAGSHIP: AI Content Pipeline

The unified AI content generation pipeline with parallel execution support, multi-model integration, and YAML-based configuration.

Core Capabilities

  • ๐Ÿ”„ Unified Pipeline Architecture - YAML/JSON-based configuration for complex multi-step workflows
  • โšก Parallel Execution Engine - 2-3x performance improvement with thread-based parallel processing
  • ๐ŸŽฏ Type-Safe Configuration - Pydantic models with comprehensive validation
  • ๐Ÿ’ฐ Cost Management - Real-time cost estimation and tracking across all services
  • ๐Ÿ“Š Rich Logging - Beautiful console output with progress tracking and performance metrics

AI Service Integrations

  • ๐Ÿ–ผ๏ธ FAL AI - Text-to-image, image-to-image, text-to-video, video generation, avatar creation
  • ๐Ÿ—ฃ๏ธ ElevenLabs - Professional text-to-speech with 20+ voice options
  • ๐ŸŽฅ Google Vertex AI - Veo video generation and Gemini text generation
  • ๐Ÿ”— OpenRouter - Alternative TTS and chat completion services

Developer Experience

  • ๐Ÿ› ๏ธ Professional CLI - Comprehensive command-line interface with Click
  • ๐Ÿ“ฆ Modular Architecture - Clean separation of concerns with extensible design
  • ๐Ÿงช Comprehensive Testing - Unit and integration tests with pytest
  • ๐Ÿ“š Type Hints - Full type coverage for excellent IDE support

๐Ÿ“ฆ Installation

Quick Start

# Install from PyPI
pip install video-ai-studio

# Or install in development mode
pip install -e .

๐Ÿ”‘ API Keys Setup

After installation, you need to configure your API keys:

  1. Download the example configuration:

    # Option 1: Download from GitHub
    curl -o .env https://raw.githubusercontent.com/donghaozhang/video-agent-skill/main/.env.example
    
    # Option 2: Create manually
    touch .env
    
  2. Add your API keys to .env:

    # Required for most functionality
    FAL_KEY=your_fal_api_key_here
    
    # Optional - add as needed
    GEMINI_API_KEY=your_gemini_api_key_here
    OPENROUTER_API_KEY=your_openrouter_api_key_here
    ELEVENLABS_API_KEY=your_elevenlabs_api_key_here
    
  3. Get API keys from:

๐Ÿ“‹ Dependencies

The package installs core dependencies automatically. See requirements.txt for the complete list.

๐Ÿ› ๏ธ Quick Start

Console Commands

# List all available AI 

...
Read full README

Publisher

donghaozhangdonghaozhang

Statistics

Stars39
Forks17
Open Issues1
CreatedMay 22, 2025