async - Cloud Security Alliance News Clipping Site

Simon Willison’s Weblog: Weeknotes: asynchronous LLMs, synchronous embeddings, and I kind of started a podcast

Nov 22, 2024

—

by

Source URL: https://simonwillison.net/2024/Nov/22/weeknotes/#atom-everything Source: Simon Willison’s Weblog Title: Weeknotes: asynchronous LLMs, synchronous embeddings, and I kind of started a podcast Feedly Summary: These past few weeks I’ve been bringing Datasette and LLM together and distracting myself with a new sort-of-podcast crossed with a live streaming experiment. Project: interviewing people about their projects Datasette Public Office…

Cloud Blog: Don’t let resource exhaustion leave your users hanging: A guide to handling 429 errors

Nov 21, 2024

—

by

system automation

in Uncategorized

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/learn-how-to-handle-429-resource-exhaustion-errors-in-your-llms/ Source: Cloud Blog Title: Don’t let resource exhaustion leave your users hanging: A guide to handling 429 errors Feedly Summary: Large language models (LLMs) give developers immense power and scalability, but managing resource consumption is key to delivering a smooth user experience. LLMs demand significant computational resources, which means it’s essential to…

Simon Willison’s Weblog: llm-gemini 0.4

Nov 18, 2024

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2024/Nov/18/llm-gemini-04/#atom-everything Source: Simon Willison’s Weblog Title: llm-gemini 0.4 Feedly Summary: llm-gemini 0.4 New release of my llm-gemini plugin, adding support for asynchronous models (see LLM 0.18), plus the new gemini-exp-1114 model (currently at the top of the Chatbot Arena) and a -o json_object 1 option to force JSON output. I also released llm-claude-3…

Simon Willison’s Weblog: LLM 0.18

Nov 17, 2024

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2024/Nov/17/llm-018/#atom-everything Source: Simon Willison’s Weblog Title: LLM 0.18 Feedly Summary: LLM 0.18 New release of LLM. The big new feature is asynchronous model support – you can now use supported models in async Python code like this: import llm model = llm.get_async_model(“gpt-4o") async for chunk in model.prompt( "Five surprising names for a pet…

Tag: async

Simon Willison’s Weblog: Weeknotes: asynchronous LLMs, synchronous embeddings, and I kind of started a podcast

Cloud Blog: Don’t let resource exhaustion leave your users hanging: A guide to handling 429 errors

Simon Willison’s Weblog: llm-gemini 0.4

Simon Willison’s Weblog: LLM 0.18