
The AI reads your screenshot as a pixel blob and guesses which button you meant. SlimSnap converts the screenshot plus your annotation into structured JSON: every element has coordinates, an ID, and your arrow points at a specific one. Around 700 tokens vs 1,568 raw on Sonnet. Free Mac app. Schema and Claude Code skill are open MIT. Runs entirely on-device.
SlimSnap is an AI tool that analyzes screenshots to identify UI elements and converts them into structured JSON format, including coordinates and IDs. The application is available for free on Mac and operates entirely on-device, with open-source schema and Claude Code skills under MIT licensing.