Tag: multi-modal input
-
Hacker News: gptel: a simple LLM client for Emacs
Source URL: https://github.com/karthink/gptel Source: Hacker News Title: gptel: a simple LLM client for Emacs Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text describes “gptel,” a client for interacting with Large Language Models (LLMs) in Emacs. It allows users to engage with different LLMs seamlessly within the Emacs environment, supporting features like contextual…
-
Simon Willison’s Weblog: You can now run prompts against images, audio and video in your terminal using LLM
Source URL: https://simonwillison.net/2024/Oct/29/llm-multi-modal/#atom-everything Source: Simon Willison’s Weblog Title: You can now run prompts against images, audio and video in your terminal using LLM Feedly Summary: I released LLM 0.17 last night, the latest version of my combined CLI tool and Python library for interacting with hundreds of different Large Language Models such as GPT-4o, Llama,…
-
The Register: Google expands visual, audio search, lets AI handle layout
Source URL: https://www.theregister.com/2024/10/03/google_expands_visual_audio_search/ Source: The Register Title: Google expands visual, audio search, lets AI handle layout Feedly Summary: AI Overviews get links to referenced websites – and ads Almost two decades ago, the head of Google’s then nascent enterprise division referred to the firm’s search service as “an uber-command line interface to the world."… AI…