Tag: advancement

  • Simon Willison’s Weblog: Qwen: Extending the Context Length to 1M Tokens

    Source URL: https://simonwillison.net/2024/Nov/18/qwen-turbo/#atom-everything Source: Simon Willison’s Weblog Title: Qwen: Extending the Context Length to 1M Tokens Feedly Summary: Qwen: Extending the Context Length to 1M Tokens The new Qwen2.5-Turbo boasts a million token context window (up from 128,000 for Qwen 2.5) and faster performance: Using sparse attention mechanisms, we successfully reduced the time to first…

  • Simon Willison’s Weblog: Quoting Jack Clark

    Source URL: https://simonwillison.net/2024/Nov/18/jack-clark/ Source: Simon Willison’s Weblog Title: Quoting Jack Clark Feedly Summary: The main innovation here is just using more data. Specifically, Qwen2.5 Coder is a continuation of an earlier Qwen 2.5 model. The original Qwen 2.5 model was trained on 18 trillion tokens spread across a variety of languages and tasks (e.g, writing,…

  • Wired: Why the US Government Banned Investments in Some Chinese AI Startups

    Source URL: https://www.wired.com/story/treasury-outbound-investment-china-artificial-intelligence/ Source: Wired Title: Why the US Government Banned Investments in Some Chinese AI Startups Feedly Summary: The Biden administration chose to target only companies developing frontier AI models in China. But Trump could take a more sweeping approach. AI Summary and Description: Yes Summary: The recent restrictions imposed by the US Treasury…

  • Simon Willison’s Weblog: llm-gemini 0.4

    Source URL: https://simonwillison.net/2024/Nov/18/llm-gemini-04/#atom-everything Source: Simon Willison’s Weblog Title: llm-gemini 0.4 Feedly Summary: llm-gemini 0.4 New release of my llm-gemini plugin, adding support for asynchronous models (see LLM 0.18), plus the new gemini-exp-1114 model (currently at the top of the Chatbot Arena) and a -o json_object 1 option to force JSON output. I also released llm-claude-3…

  • Hacker News: You could have designed state of the art positional encoding

    Source URL: https://fleetwood.dev/posts/you-could-have-designed-SOTA-positional-encoding Source: Hacker News Title: You could have designed state of the art positional encoding Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses the evolution of positional encoding in transformer models, specifically focusing on Rotary Positional Encoding (RoPE) as utilized in modern language models like Llama 3.2. It explains…

  • Simon Willison’s Weblog: LLM 0.18

    Source URL: https://simonwillison.net/2024/Nov/17/llm-018/#atom-everything Source: Simon Willison’s Weblog Title: LLM 0.18 Feedly Summary: LLM 0.18 New release of LLM. The big new feature is asynchronous model support – you can now use supported models in async Python code like this: import llm model = llm.get_async_model(“gpt-4o") async for chunk in model.prompt( "Five surprising names for a pet…

  • The Register: Will passkeys ever replace passwords? Can they?

    Source URL: https://www.theregister.com/2024/11/17/passkeys_passwords/ Source: The Register Title: Will passkeys ever replace passwords? Can they? Feedly Summary: Here’s why they really should Systems Approach I have been playing around with passkeys, or as they are formally known, discoverable credentials.… AI Summary and Description: Yes **Summary:** The text discusses the concept and implementation of passkeys, which are…

  • Hacker News: Memos – An open source Rewinds / Recall

    Source URL: https://github.com/arkohut/memos Source: Hacker News Title: Memos – An open source Rewinds / Recall Feedly Summary: Comments AI Summary and Description: Yes Summary: The text describes “Memos,” a privacy-centric software tool designed for passive screen recording. Its primary focus is on user data control, ensuring all recording and processing occur locally, which aligns with…

  • Hacker News: All-in-one embedding model for interleaved text, images, and screenshots

    Source URL: https://blog.voyageai.com/2024/11/12/voyage-multimodal-3/ Source: Hacker News Title: All-in-one embedding model for interleaved text, images, and screenshots Feedly Summary: Comments AI Summary and Description: Yes Summary: The text announces the release of voyage-multimodal-3, a cutting-edge multimodal embedding model that enhances the capability of semantic search and retrieval tasks involving both text and images. Its ability to…

  • Hacker News: SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks

    Source URL: https://arxiv.org/abs/2310.03684 Source: Hacker News Title: SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks Feedly Summary: Comments AI Summary and Description: Yes Summary: This text presents “SmoothLLM,” an innovative algorithm designed to enhance the security of Large Language Models (LLMs) against jailbreaking attacks, which manipulate models into producing undesirable content. The proposal highlights a…