Tag: developers

  • Simon Willison’s Weblog: Pixtral Large

    Source URL: https://simonwillison.net/2024/Nov/18/pixtral-large/ Source: Simon Willison’s Weblog Title: Pixtral Large Feedly Summary: Pixtral Large New today from Mistral: Today we announce Pixtral Large, a 124B open-weights multimodal model built on top of Mistral Large 2. Pixtral Large is the second model in our multimodal family and demonstrates frontier-level image understanding. The weights are out on…

  • Simon Willison’s Weblog: llm-gemini 0.4

    Source URL: https://simonwillison.net/2024/Nov/18/llm-gemini-04/#atom-everything Source: Simon Willison’s Weblog Title: llm-gemini 0.4 Feedly Summary: llm-gemini 0.4 New release of my llm-gemini plugin, adding support for asynchronous models (see LLM 0.18), plus the new gemini-exp-1114 model (currently at the top of the Chatbot Arena) and a -o json_object 1 option to force JSON output. I also released llm-claude-3…

  • Hacker News: Why LLMs Within Software Development May Be a Dead End

    Source URL: https://thenewstack.io/why-llms-within-software-development-may-be-a-dead-end/ Source: Hacker News Title: Why LLMs Within Software Development May Be a Dead End Feedly Summary: Comments AI Summary and Description: Yes Summary: The text provides a critical perspective on the limitations of current Large Language Models (LLMs) regarding their composability, explainability, and security implications for software development. It argues that LLMs…

  • Hacker News: You could have designed state of the art positional encoding

    Source URL: https://fleetwood.dev/posts/you-could-have-designed-SOTA-positional-encoding Source: Hacker News Title: You could have designed state of the art positional encoding Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses the evolution of positional encoding in transformer models, specifically focusing on Rotary Positional Encoding (RoPE) as utilized in modern language models like Llama 3.2. It explains…

  • Simon Willison’s Weblog: LLM 0.18

    Source URL: https://simonwillison.net/2024/Nov/17/llm-018/#atom-everything Source: Simon Willison’s Weblog Title: LLM 0.18 Feedly Summary: LLM 0.18 New release of LLM. The big new feature is asynchronous model support – you can now use supported models in async Python code like this: import llm model = llm.get_async_model(“gpt-4o") async for chunk in model.prompt( "Five surprising names for a pet…

  • Hacker News: Garak, LLM Vulnerability Scanner

    Source URL: https://github.com/NVIDIA/garak Source: Hacker News Title: Garak, LLM Vulnerability Scanner Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text describes “garak,” a command-line vulnerability scanner specifically designed for large language models (LLMs). This tool aims to uncover various weaknesses in LLMs, such as hallucination, prompt injection attacks, and data leakage. Its development…

  • Slashdot: AI Lab PleIAs Releases Fully Open Dataset, as AMD, Ai2 Release Open AI Models

    Source URL: https://news.slashdot.org/story/24/11/16/0326222/ai-lab-pleias-releases-fully-open-dataset-as-amd-ai2-release-open-ai-models?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: AI Lab PleIAs Releases Fully Open Dataset, as AMD, Ai2 Release Open AI Models Feedly Summary: AI Summary and Description: Yes Summary: The text outlines PleIAs’ commitment to open training for large language models (LLMs) through the release of Common Corpus, highlighting the significance of open data for LLM…

  • Hacker News: How Public Key Cryptography Works, Using Only Simple Math

    Source URL: https://www.quantamagazine.org/how-public-key-cryptography-really-works-20241115/ Source: Hacker News Title: How Public Key Cryptography Works, Using Only Simple Math Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text provides a comprehensive overview of public key cryptography, explaining its fundamental principles, historical development, and potential vulnerabilities posed by quantum computing. This is particularly relevant for security professionals…

  • The Register: Google Gemini tells grad student to ‘please die’ after helping with his homework

    Source URL: https://www.theregister.com/2024/11/15/google_gemini_prompt_bad_response/ Source: The Register Title: Google Gemini tells grad student to ‘please die’ after helping with his homework Feedly Summary: First true sign of AGI – blowing a fuse with a frustrating user? When you’re trying to get homework help from an AI model like Google Gemini, the last thing you’d expect is…