Vision-based auto-approval system for Claude Code CLI using MiniCPM-V vision model. Features: - Automatic detection and response to approval prompts - Screenshot capture and vision analysis via Ollama - Support for multiple screenshot tools (scrot, gnome-screenshot, etc.) - Configurable timing and behavior - Debug mode for troubleshooting - Comprehensive documentation Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com> Co-Authored-By: Jean-Philippe Brule <jp@svrnty.io>
1.3 KiB
1.3 KiB
Changelog
All notable changes to Claude Vision Auto will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
[1.0.0] - 2025-10-29
Added
- Initial release of Claude Vision Auto
- Vision-based auto-approval using MiniCPM-V
- Support for multiple screenshot tools (scrot, gnome-screenshot, ImageMagick, maim)
- Configurable timing and behavior via environment variables
- Debug mode for troubleshooting
- Comprehensive documentation (README, INSTALLATION, USAGE)
- MIT License
- Example usage scripts
- Basic test suite
Features
- Automatic detection of approval prompts
- Screenshot capture of terminal window
- Vision analysis via Ollama API
- Intelligent response submission (1, y, WAIT)
- Configurable idle threshold and response delay
- Support for multiple vision models (MiniCPM-V, Llama 3.2 Vision, LLaVA)
- Automatic screenshot cleanup
- Connection testing and validation
Supported Platforms
- Linux (Debian/Ubuntu tested)
- X11 display server
- Python 3.8+
[Unreleased]
Planned
- Wayland support
- macOS support
- Headless mode (API-only)
- Configurable response patterns
- Multi-terminal support
- Session recording and replay
- Windows support (WSL)