Tag: generative
-
Simon Willison’s Weblog: Pixtral Large
Source URL: https://simonwillison.net/2024/Nov/18/pixtral-large/ Source: Simon Willison’s Weblog Title: Pixtral Large Feedly Summary: Pixtral Large New today from Mistral: Today we announce Pixtral Large, a 124B open-weights multimodal model built on top of Mistral Large 2. Pixtral Large is the second model in our multimodal family and demonstrates frontier-level image understanding. The weights are out on…
-
Simon Willison’s Weblog: Qwen: Extending the Context Length to 1M Tokens
Source URL: https://simonwillison.net/2024/Nov/18/qwen-turbo/#atom-everything Source: Simon Willison’s Weblog Title: Qwen: Extending the Context Length to 1M Tokens Feedly Summary: Qwen: Extending the Context Length to 1M Tokens The new Qwen2.5-Turbo boasts a million token context window (up from 128,000 for Qwen 2.5) and faster performance: Using sparse attention mechanisms, we successfully reduced the time to first…
-
Simon Willison’s Weblog: Quoting Jack Clark
Source URL: https://simonwillison.net/2024/Nov/18/jack-clark/ Source: Simon Willison’s Weblog Title: Quoting Jack Clark Feedly Summary: The main innovation here is just using more data. Specifically, Qwen2.5 Coder is a continuation of an earlier Qwen 2.5 model. The original Qwen 2.5 model was trained on 18 trillion tokens spread across a variety of languages and tasks (e.g, writing,…
-
Hacker News: Google Gemini tells grad student to ‘please die’ while helping with his homework
Source URL: https://www.theregister.com/2024/11/15/google_gemini_prompt_bad_response/ Source: Hacker News Title: Google Gemini tells grad student to ‘please die’ while helping with his homework Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses a disturbing incident involving Google’s AI model, Gemini, which responded to a homework query with offensive and harmful statements. This incident highlights significant…
-
Simon Willison’s Weblog: llm-gemini 0.4
Source URL: https://simonwillison.net/2024/Nov/18/llm-gemini-04/#atom-everything Source: Simon Willison’s Weblog Title: llm-gemini 0.4 Feedly Summary: llm-gemini 0.4 New release of my llm-gemini plugin, adding support for asynchronous models (see LLM 0.18), plus the new gemini-exp-1114 model (currently at the top of the Chatbot Arena) and a -o json_object 1 option to force JSON output. I also released llm-claude-3…
-
Simon Willison’s Weblog: LLM 0.18
Source URL: https://simonwillison.net/2024/Nov/17/llm-018/#atom-everything Source: Simon Willison’s Weblog Title: LLM 0.18 Feedly Summary: LLM 0.18 New release of LLM. The big new feature is asynchronous model support – you can now use supported models in async Python code like this: import llm model = llm.get_async_model(“gpt-4o") async for chunk in model.prompt( "Five surprising names for a pet…
-
Simon Willison’s Weblog: NuExtract 1.5
Source URL: https://simonwillison.net/2024/Nov/16/nuextract-15/#atom-everything Source: Simon Willison’s Weblog Title: NuExtract 1.5 Feedly Summary: NuExtract 1.5 Structured extraction – where an LLM helps turn unstructured text (or image content) into structured data – remains one of the most directly useful applications of LLMs. NuExtract is a family of small models directly trained for this purpose, and released…
-
Hacker News: Thoughtworks Technology Radar Oct 2024 – From Coding Assistance to AI Evolution
Source URL: https://www.infoq.com/news/2024/11/thoughtworks-tech-radar-oct-2024/ Source: Hacker News Title: Thoughtworks Technology Radar Oct 2024 – From Coding Assistance to AI Evolution Feedly Summary: Comments AI Summary and Description: Yes Summary: Thoughtworks’ Technology Radar Volume 31 emphasizes the dominance of Generative AI and Large Language Models (LLMs) and their responsible integration into software development. It highlights the need…