Tag: injection
-
Simon Willison’s Weblog: Quoting Question for Department for Science, Innovation and Technology
Source URL: https://simonwillison.net/2024/Nov/1/prompt-injection/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Question for Department for Science, Innovation and Technology Feedly Summary: Lord Clement-Jones: To ask His Majesty’s Government what assessment they have made of the cybersecurity risks posed by prompt injection attacks to the processing by generative artificial intelligence of material provided from outside government, and whether…
-
The Register: Google reportedly developing an AI agent that can control your browser
Source URL: https://www.theregister.com/2024/10/28/google_ai_web_agent/ Source: The Register Title: Google reportedly developing an AI agent that can control your browser Feedly Summary: Project Jarvis will apparently conduct research, purchase products, and even book a flight on your behalf Google is reportedly looking to sidestep the complexity of AI-driven automation by letting its multimodal large language models (LLMs)…
-
The Register: Anthropic’s latest Claude model can interact with computers – what could go wrong?
Source URL: https://www.theregister.com/2024/10/24/anthropic_claude_model_can_use_computers/ Source: The Register Title: Anthropic’s latest Claude model can interact with computers – what could go wrong? Feedly Summary: For starters, it could launch a prompt injection attack on itself… The latest version of AI startup Anthropic’s Claude 3.5 Sonnet model can use computers – and the developer makes it sound like…
-
Simon Willison’s Weblog: Quoting Model Card Addendum: Claude 3.5 Haiku and Upgraded Sonnet
Source URL: https://simonwillison.net/2024/Oct/23/model-card/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Model Card Addendum: Claude 3.5 Haiku and Upgraded Sonnet Feedly Summary: We enhanced the ability of the upgraded Claude 3.5 Sonnet and Claude 3.5 Haiku to recognize and resist prompt injection attempts. Prompt injection is an attack where a malicious user feeds instructions to a model…
-
METR Blog – METR: An update on our general capability evaluations
Source URL: https://metr.org/blog/2024-08-06-update-on-evaluations/ Source: METR Blog – METR Title: An update on our general capability evaluations Feedly Summary: AI Summary and Description: Yes **Summary:** The provided text discusses the development of evaluation metrics for AI capabilities, particularly focusing on autonomous systems. It aims to create measures that can assess general autonomy rather than solely relying…