Tag: image segmentation
-
Simon Willison’s Weblog: Gemini 2.5 Computer Use can solve Google’s own CAPTCHAs
Source URL: https://simonwillison.net/2025/Oct/7/gemini-25-computer-use-captchas/ Source: Simon Willison’s Weblog Title: Gemini 2.5 Computer Use can solve Google’s own CAPTCHAs Feedly Summary: Google just introduced a new Gemini 2.5 Computer Use model, specially designed to help operate a GUI interface by interacting with visible elements using a virtual mouse and keyboard. I just tried their demo… and watched…
-
Simon Willison’s Weblog: AI assisted search-based research actually works now
Source URL: https://simonwillison.net/2025/Apr/21/ai-assisted-search/#atom-everything Source: Simon Willison’s Weblog Title: AI assisted search-based research actually works now Feedly Summary: For the past two and a half years the feature I’ve most wanted from LLMs is the ability to take on search-based research tasks on my behalf. We saw the first glimpses of this back in early 2023,…
-
Simon Willison’s Weblog: Image segmentation using Gemini 2.5
Source URL: https://simonwillison.net/2025/Apr/18/gemini-image-segmentation/ Source: Simon Willison’s Weblog Title: Image segmentation using Gemini 2.5 Feedly Summary: Max Woolf pointed out this new feature of the Gemini 2.5 series in a comment on Hacker News: One hidden note from Gemini 2.5 Flash when diving deep into the documentation: for image inputs, not only can the model be…