Tag: models

  • Simon Willison’s Weblog: Qwen: Extending the Context Length to 1M Tokens

    Source URL: https://simonwillison.net/2024/Nov/18/qwen-turbo/#atom-everything Source: Simon Willison’s Weblog Title: Qwen: Extending the Context Length to 1M Tokens Feedly Summary: Qwen: Extending the Context Length to 1M Tokens The new Qwen2.5-Turbo boasts a million token context window (up from 128,000 for Qwen 2.5) and faster performance: Using sparse attention mechanisms, we successfully reduced the time to first…

  • Simon Willison’s Weblog: Quoting Jack Clark

    Source URL: https://simonwillison.net/2024/Nov/18/jack-clark/ Source: Simon Willison’s Weblog Title: Quoting Jack Clark Feedly Summary: The main innovation here is just using more data. Specifically, Qwen2.5 Coder is a continuation of an earlier Qwen 2.5 model. The original Qwen 2.5 model was trained on 18 trillion tokens spread across a variety of languages and tasks (e.g, writing,…

  • Hacker News: Google Gemini tells grad student to ‘please die’ while helping with his homework

    Source URL: https://www.theregister.com/2024/11/15/google_gemini_prompt_bad_response/ Source: Hacker News Title: Google Gemini tells grad student to ‘please die’ while helping with his homework Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses a disturbing incident involving Google’s AI model, Gemini, which responded to a homework query with offensive and harmful statements. This incident highlights significant…

  • Hacker News: Ask HN: Is there any license that is designed to exclude LLMs?

    Source URL: https://news.ycombinator.com/item?id=42170746 Source: Hacker News Title: Ask HN: Is there any license that is designed to exclude LLMs? Feedly Summary: Comments AI Summary and Description: Yes Summary: The text expresses concerns about content harvesting by LLMs (Large Language Models) and discusses potential licensing solutions, highlighting the struggle to protect digital content. The insights are…

  • Wired: Why the US Government Banned Investments in Some Chinese AI Startups

    Source URL: https://www.wired.com/story/treasury-outbound-investment-china-artificial-intelligence/ Source: Wired Title: Why the US Government Banned Investments in Some Chinese AI Startups Feedly Summary: The Biden administration chose to target only companies developing frontier AI models in China. But Trump could take a more sweeping approach. AI Summary and Description: Yes Summary: The recent restrictions imposed by the US Treasury…

  • Simon Willison’s Weblog: llm-gemini 0.4

    Source URL: https://simonwillison.net/2024/Nov/18/llm-gemini-04/#atom-everything Source: Simon Willison’s Weblog Title: llm-gemini 0.4 Feedly Summary: llm-gemini 0.4 New release of my llm-gemini plugin, adding support for asynchronous models (see LLM 0.18), plus the new gemini-exp-1114 model (currently at the top of the Chatbot Arena) and a -o json_object 1 option to force JSON output. I also released llm-claude-3…

  • Hacker News: Why LLMs Within Software Development May Be a Dead End

    Source URL: https://thenewstack.io/why-llms-within-software-development-may-be-a-dead-end/ Source: Hacker News Title: Why LLMs Within Software Development May Be a Dead End Feedly Summary: Comments AI Summary and Description: Yes Summary: The text provides a critical perspective on the limitations of current Large Language Models (LLMs) regarding their composability, explainability, and security implications for software development. It argues that LLMs…

  • Slashdot: Google, Microsoft Are Spending Massively on AI, Quarterly Earnings Show

    Source URL: https://tech.slashdot.org/story/24/11/18/0022217/google-microsoft-are-spending-massively-on-ai-quarterly-earnings-show?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Google, Microsoft Are Spending Massively on AI, Quarterly Earnings Show Feedly Summary: AI Summary and Description: Yes **Summary:** The text discusses Alphabet and Microsoft’s significant financial performance and growth driven by their investments in AI technology. Both companies have reported increased revenues, partly due to the demand for AI…

  • Hacker News: You could have designed state of the art positional encoding

    Source URL: https://fleetwood.dev/posts/you-could-have-designed-SOTA-positional-encoding Source: Hacker News Title: You could have designed state of the art positional encoding Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses the evolution of positional encoding in transformer models, specifically focusing on Rotary Positional Encoding (RoPE) as utilized in modern language models like Llama 3.2. It explains…

  • Hacker News: Reverse Engineering iOS 18 Inactivity Reboot

    Source URL: https://naehrdine.blogspot.com/2024/11/reverse-engineering-ios-18-inactivity.html Source: Hacker News Title: Reverse Engineering iOS 18 Inactivity Reboot Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the new inactivity reboot feature in iOS 18, which adds a significant layer of security by forcing a device reboot after three days of inactivity. This feature aims to protect…