Erstellt aus CDs Musik für Jellyfin
Find a file
dschlueter 1753ab204f Add Vision-LLM mode for direct image-to-JSON extraction
Tesseract OCR fails on rotated/low-contrast CD back covers.
New vision_llm module sends images directly to qwen3-vl via
Ollama chat API, bypassing OCR entirely. Robust JSON extraction
handles thinking tags, markdown blocks, and empty responses.
CLI scan/process commands gain --vision flag.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-15 01:35:05 +01:00
idea initial commit 2026-02-15 01:00:12 +01:00
src/musiksammlung Add Vision-LLM mode for direct image-to-JSON extraction 2026-02-15 01:35:05 +01:00
tests Add Vision-LLM mode for direct image-to-JSON extraction 2026-02-15 01:35:05 +01:00
.gitignore Add Vision-LLM mode for direct image-to-JSON extraction 2026-02-15 01:35:05 +01:00
LICENSE Initial commit 2026-02-15 00:53:30 +01:00
pyproject.toml Add project skeleton: CLI pipeline for CD digitization 2026-02-15 01:00:12 +01:00
README.md Initial commit 2026-02-15 00:53:30 +01:00

Musiksammlung

Erstellt aus CDs Musik für Jellyfin