Tag: training data

  • The Register: Judge tosses publishers’ copyright suit against OpenAI

    Source URL: https://www.theregister.com/2024/11/08/openai_copyright_suit_dismissed/ Source: The Register Title: Judge tosses publishers’ copyright suit against OpenAI Feedly Summary: Raw Story and AltNet allowed to amend complaint A US judge has thrown out a case against ChatGPT developer OpenAI which alleged it unlawfully removed copyright management information (CMI) when building training sets for its chatbots.… AI Summary and…

  • Schneier on Security: AI Industry is Trying to Subvert the Definition of “Open Source AI”

    Source URL: https://www.schneier.com/blog/archives/2024/11/ai-industry-is-trying-to-subvert-the-definition-of-open-source-ai.html Source: Schneier on Security Title: AI Industry is Trying to Subvert the Definition of “Open Source AI” Feedly Summary: The Open Source Initiative has published (news article here) its definition of “open source AI,” and it’s terrible. It allows for secret training data and mechanisms. It allows for development to be done…

  • Hacker News: Perceptually lossless (talking head) video compression at 22kbit/s

    Source URL: https://mlumiste.com/technical/liveportrait-compression/ Source: Hacker News Title: Perceptually lossless (talking head) video compression at 22kbit/s Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses the recent advancements in the LivePortrait model for animating still images and its implications for video compression, particularly in the realm of deepfake technology. This innovation presents significant…

  • Hacker News: WebRL: Training LLM Web Agents via Self-Evolving Online Reinforcement Learning

    Source URL: https://arxiv.org/abs/2411.02337 Source: Hacker News Title: WebRL: Training LLM Web Agents via Self-Evolving Online Reinforcement Learning Feedly Summary: Comments AI Summary and Description: Yes Summary: The paper introduces WebRL, a novel framework that employs self-evolving online curriculum reinforcement learning to enhance the training of large language models (LLMs) as web agents. This development is…

  • Simon Willison’s Weblog: Nous Hermes 3

    Source URL: https://simonwillison.net/2024/Nov/4/nous-hermes-3/#atom-everything Source: Simon Willison’s Weblog Title: Nous Hermes 3 Feedly Summary: Nous Hermes 3 The Nous Hermes family of fine-tuned models have a solid reputation. Their most recent release came out in August, based on Meta’s Llama 3.1: Our training data aggressively encourages the model to follow the system and instruction prompts exactly…

  • Slashdot: New ‘Open Source AI Definition’ Criticized for Not Opening Training Data

    Source URL: https://news.slashdot.org/story/24/11/03/0257241/new-open-source-ai-definition-criticized-for-not-opening-training-data?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: New ‘Open Source AI Definition’ Criticized for Not Opening Training Data Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the controversy surrounding the newly-released Open Source AI definition, which some believe undermines traditional open-source principles by allowing certain proprietary practices around training data. The concerns raised…

  • Hacker News: SmolLM2

    Source URL: https://simonwillison.net/2024/Nov/2/smollm2/ Source: Hacker News Title: SmolLM2 Feedly Summary: Comments AI Summary and Description: Yes Summary: The text introduces SmolLM2, a new family of compact language models from Hugging Face, designed for lightweight on-device operations. The models, which range from 135M to 1.7B parameters, were trained on 11 trillion tokens across diverse datasets, showcasing…

  • Simon Willison’s Weblog: SmolLM2

    Source URL: https://simonwillison.net/2024/Nov/2/smollm2/#atom-everything Source: Simon Willison’s Weblog Title: SmolLM2 Feedly Summary: SmolLM2 New from Loubna Ben Allal and her research team at Hugging Face: SmolLM2 is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters. They are capable of solving a wide range of tasks while being lightweight enough…

  • Hacker News: Show HN: Cerebellum – Open-Source Browser Control with Claude 3.5 Computer Use

    Source URL: https://github.com/theredsix/cerebellum Source: Hacker News Title: Show HN: Cerebellum – Open-Source Browser Control with Claude 3.5 Computer Use Feedly Summary: Comments AI Summary and Description: Yes Summary: The text outlines the functionality of Cerebellum, a lightweight browser automation tool utilizing a large language model (LLM) to achieve user-defined goals through web interactions. Its significance…

  • Wired: This Is a Glimpse of the Future of AI Robot

    Source URL: https://www.wired.com/story/physical-intelligence-home-robot/ Source: Wired Title: This Is a Glimpse of the Future of AI Robot Feedly Summary: Physical Intelligence, a well-funded startup chasing breakthroughs in robotic intelligence, has developed a robot capable of doing various household chores remarkably well. AI Summary and Description: Yes Summary: The text discusses Physical Intelligence, a startup that has…