Vision-module-auto/CHANGELOG.md
Svrnty 41cecca0e2 Initial release of Claude Vision Auto v1.0.0
Vision-based auto-approval system for Claude Code CLI using MiniCPM-V vision model.

Features:
- Automatic detection and response to approval prompts
- Screenshot capture and vision analysis via Ollama
- Support for multiple screenshot tools (scrot, gnome-screenshot, etc.)
- Configurable timing and behavior
- Debug mode for troubleshooting
- Comprehensive documentation

Generated with Claude Code

Co-Authored-By: Claude <noreply@anthropic.com>
Co-Authored-By: Jean-Philippe Brule <jp@svrnty.io>
2025-10-29 10:09:01 -04:00

1.3 KiB

Changelog

All notable changes to Claude Vision Auto will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[1.0.0] - 2025-10-29

Added

  • Initial release of Claude Vision Auto
  • Vision-based auto-approval using MiniCPM-V
  • Support for multiple screenshot tools (scrot, gnome-screenshot, ImageMagick, maim)
  • Configurable timing and behavior via environment variables
  • Debug mode for troubleshooting
  • Comprehensive documentation (README, INSTALLATION, USAGE)
  • MIT License
  • Example usage scripts
  • Basic test suite

Features

  • Automatic detection of approval prompts
  • Screenshot capture of terminal window
  • Vision analysis via Ollama API
  • Intelligent response submission (1, y, WAIT)
  • Configurable idle threshold and response delay
  • Support for multiple vision models (MiniCPM-V, Llama 3.2 Vision, LLaVA)
  • Automatic screenshot cleanup
  • Connection testing and validation

Supported Platforms

  • Linux (Debian/Ubuntu tested)
  • X11 display server
  • Python 3.8+

[Unreleased]

Planned

  • Wayland support
  • macOS support
  • Headless mode (API-only)
  • Configurable response patterns
  • Multi-terminal support
  • Session recording and replay
  • Windows support (WSL)