Tag: prompt-engineering
-
Simon Willison’s Weblog: OK, I can partly explain the LLM chess weirdness now
Source URL: https://simonwillison.net/2024/Nov/21/llm-chess/#atom-everything Source: Simon Willison’s Weblog Title: OK, I can partly explain the LLM chess weirdness now Feedly Summary: OK, I can partly explain the LLM chess weirdness now Last week Dynomight published Something weird is happening with LLMs and chess pointing out that most LLMs are terrible chess players with the exception of…
-
Cloud Blog: Use AI to build AI: Save time on prompt design with AI-powered prompt writing
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/save-time-on-prompt-design-with-ai-powered-prompt-writing/ Source: Cloud Blog Title: Use AI to build AI: Save time on prompt design with AI-powered prompt writing Feedly Summary: Crafting the perfect prompt for generative AI models can be an art in itself. The difference between a useful and a generic AI response can sometimes be a well-crafted prompt. But, getting…
-
Hacker News: Visual inference exploration and experimentation playground
Source URL: https://github.com/devidw/inferit Source: Hacker News Title: Visual inference exploration and experimentation playground Feedly Summary: Comments AI Summary and Description: Yes Summary: The text introduces “inferit,” a tool designed for large language model (LLM) inference that enables users to visually compare outputs from various models, prompts, and settings. It stands out by allowing unlimited side-by-side…
-
Simon Willison’s Weblog: Project: VERDAD – tracking misinformation in radio broadcasts using Gemini 1.5
Source URL: https://simonwillison.net/2024/Nov/7/project-verdad/#atom-everything Source: Simon Willison’s Weblog Title: Project: VERDAD – tracking misinformation in radio broadcasts using Gemini 1.5 Feedly Summary: I’m starting a new interview series called Project. The idea is to interview people who are building interesting data projects and talk about what they’ve built, how they built it, and what they learned…
-
Simon Willison’s Weblog: From Naptime to Big Sleep: Using Large Language Models To Catch Vulnerabilities In Real-World Code
Source URL: https://simonwillison.net/2024/Nov/1/from-naptime-to-big-sleep/#atom-everything Source: Simon Willison’s Weblog Title: From Naptime to Big Sleep: Using Large Language Models To Catch Vulnerabilities In Real-World Code Feedly Summary: From Naptime to Big Sleep: Using Large Language Models To Catch Vulnerabilities In Real-World Code Google’s Project Zero security team used a system based around Gemini 1.5 Pro to find…
-
Cloud Blog: When to use supervised fine-tuning for Gemini
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/supervised-fine-tuning-for-gemini-llm/ Source: Cloud Blog Title: When to use supervised fine-tuning for Gemini Feedly Summary: Have you ever wished you could get a foundation model to respond in a particular style, exhibit domain-specific expertise, or excel at a specific task? While foundation models like Gemini demonstrate remarkable capabilities out-of-the-box, there can be a gap…
-
Simon Willison’s Weblog: Notes on OpenAI’s new o1 chain-of-thought models
Source URL: https://simonwillison.net/2024/Sep/12/openai-o1/ Source: Simon Willison’s Weblog Title: Notes on OpenAI’s new o1 chain-of-thought models Feedly Summary: OpenAI released two major new preview models today: o1-preview and o1-mini (that mini one is also a preview, despite the name) – previously rumored as having the codename “strawberry". There’s a lot to understand about these models –…
-
Hacker News: PathPilot (YC S24) Is Hiring a Founding AI and Full-Stack Engineer
Source URL: https://www.ycombinator.com/companies/pathpilot/jobs/GlywVaz-founding-engineer-ai-full-stack Source: Hacker News Title: PathPilot (YC S24) Is Hiring a Founding AI and Full-Stack Engineer Feedly Summary: Comments AI Summary and Description: Yes Summary: The text outlines a job posting for a full-stack engineer specializing in large language models (LLMs) to drive the development of an AI-driven customer experience platform. Its focus…