Tag: advancement
-
Simon Willison’s Weblog: Qwen: Extending the Context Length to 1M Tokens
Source URL: https://simonwillison.net/2024/Nov/18/qwen-turbo/#atom-everything Source: Simon Willison’s Weblog Title: Qwen: Extending the Context Length to 1M Tokens Feedly Summary: Qwen: Extending the Context Length to 1M Tokens The new Qwen2.5-Turbo boasts a million token context window (up from 128,000 for Qwen 2.5) and faster performance: Using sparse attention mechanisms, we successfully reduced the time to first…
-
Simon Willison’s Weblog: Quoting Jack Clark
Source URL: https://simonwillison.net/2024/Nov/18/jack-clark/ Source: Simon Willison’s Weblog Title: Quoting Jack Clark Feedly Summary: The main innovation here is just using more data. Specifically, Qwen2.5 Coder is a continuation of an earlier Qwen 2.5 model. The original Qwen 2.5 model was trained on 18 trillion tokens spread across a variety of languages and tasks (e.g, writing,…
-
Wired: Why the US Government Banned Investments in Some Chinese AI Startups
Source URL: https://www.wired.com/story/treasury-outbound-investment-china-artificial-intelligence/ Source: Wired Title: Why the US Government Banned Investments in Some Chinese AI Startups Feedly Summary: The Biden administration chose to target only companies developing frontier AI models in China. But Trump could take a more sweeping approach. AI Summary and Description: Yes Summary: The recent restrictions imposed by the US Treasury…
-
Simon Willison’s Weblog: llm-gemini 0.4
Source URL: https://simonwillison.net/2024/Nov/18/llm-gemini-04/#atom-everything Source: Simon Willison’s Weblog Title: llm-gemini 0.4 Feedly Summary: llm-gemini 0.4 New release of my llm-gemini plugin, adding support for asynchronous models (see LLM 0.18), plus the new gemini-exp-1114 model (currently at the top of the Chatbot Arena) and a -o json_object 1 option to force JSON output. I also released llm-claude-3…
-
Simon Willison’s Weblog: LLM 0.18
Source URL: https://simonwillison.net/2024/Nov/17/llm-018/#atom-everything Source: Simon Willison’s Weblog Title: LLM 0.18 Feedly Summary: LLM 0.18 New release of LLM. The big new feature is asynchronous model support – you can now use supported models in async Python code like this: import llm model = llm.get_async_model(“gpt-4o") async for chunk in model.prompt( "Five surprising names for a pet…
-
The Register: Will passkeys ever replace passwords? Can they?
Source URL: https://www.theregister.com/2024/11/17/passkeys_passwords/ Source: The Register Title: Will passkeys ever replace passwords? Can they? Feedly Summary: Here’s why they really should Systems Approach I have been playing around with passkeys, or as they are formally known, discoverable credentials.… AI Summary and Description: Yes **Summary:** The text discusses the concept and implementation of passkeys, which are…
-
Hacker News: Memos – An open source Rewinds / Recall
Source URL: https://github.com/arkohut/memos Source: Hacker News Title: Memos – An open source Rewinds / Recall Feedly Summary: Comments AI Summary and Description: Yes Summary: The text describes “Memos,” a privacy-centric software tool designed for passive screen recording. Its primary focus is on user data control, ensuring all recording and processing occur locally, which aligns with…
-
Hacker News: SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks
Source URL: https://arxiv.org/abs/2310.03684 Source: Hacker News Title: SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks Feedly Summary: Comments AI Summary and Description: Yes Summary: This text presents “SmoothLLM,” an innovative algorithm designed to enhance the security of Large Language Models (LLMs) against jailbreaking attacks, which manipulate models into producing undesirable content. The proposal highlights a…