Tag: visual content

  • Simon Willison’s Weblog: Say hello to gemini-exp-1121

    Source URL: https://simonwillison.net/2024/Nov/22/gemini-exp-1121/#atom-everything Source: Simon Willison’s Weblog Title: Say hello to gemini-exp-1121 Feedly Summary: Say hello to gemini-exp-1121 Google Gemini’s Logan Kilpatrick on Twitter: Say hello to gemini-exp-1121! Our latest experimental gemini model, with: significant gains on coding performance stronger reasoning capabilities improved visual understanding Available on Google AI Studio and the Gemini API right…

  • Simon Willison’s Weblog: Ollama: Llama 3.2 Vision

    Source URL: https://simonwillison.net/2024/Nov/13/ollama-llama-vision/#atom-everything Source: Simon Willison’s Weblog Title: Ollama: Llama 3.2 Vision Feedly Summary: Ollama: Llama 3.2 Vision Ollama released version 0.4 last week with support for Meta’s first Llama vision model, Llama 3.2. If you have Ollama installed you can fetch the 11B model (7.9 GB) like this: ollama pull llama3.2-vision Or the larger…

  • Simon Willison’s Weblog: Claude API: PDF support (beta)

    Source URL: https://simonwillison.net/2024/Nov/1/claude-api-pdf-support-beta/#atom-everything Source: Simon Willison’s Weblog Title: Claude API: PDF support (beta) Feedly Summary: Claude API: PDF support (beta) Claude 3.5 Sonnet now accepts PDFs as attachments: The new Claude 3.5 Sonnet (claude-3-5-sonnet-20241022) model now supports PDF input and understands both text and visual content within documents. I just released llm-claude-3 0.7 with support…