Source URL: https://simonwillison.net/2024/Oct/31/cerebras-coder/#atom-everything
Source: Simon Willison’s Weblog
Title: Cerebras Coder
Feedly Summary: Cerebras Coder
Val Town founder Steve Krouse has been building demos on top of the Cerebras API that runs Llama3.1-70b at 2,000 tokens/second.
Having a capable LLM with that kind of performance turns out to be really interesting. Cerebras Coder is a demo that implements Claude Artifact-style on-demand JavaScript apps, and having it run at that speed means changes you request are visible within less than a second:
Steve’s implementation (created with the help of Townie, the Val Town code assistant) demonstrates the simplest possible version of an iframe sandbox: