The Register: Perplexity AI decries News Corp’s ‘simply false’ data scraping claims

Source URL: https://www.theregister.com/2024/10/25/perplexity_news_corp_data/
Source: The Register
Title: Perplexity AI decries News Corp’s ‘simply false’ data scraping claims

Feedly Summary: ‘They prefer to live in a world where publicly reported facts are owned by corporations’
Artificial intelligence startup Perplexity AI has hit back at a lawsuit claiming that it’s unfairly harvesting data from Dow Jones & Co and the New York Post to feed its AI engine, as well as stealing and mangling content.…

AI Summary and Description: Yes

Summary: Perplexity AI faces a lawsuit from News Corp alleging unauthorized data usage and content misattribution, to which Perplexity has strongly responded, denying the allegations and emphasizing its openness to dialogue. This lawsuit highlights the ongoing tension between generative AI technologies and media companies regarding data usage, ownership, and attribution.

Detailed Description:
The ongoing legal battle between Perplexity AI and News Corp underscores significant issues surrounding intellectual property rights in the rapidly evolving landscape of AI and media. The lawsuit implicates various critical aspects relevant to professionals in the realms of AI, information security, and compliance:

– **Lawsuit Allegations**:
– News Corp alleges that Perplexity is improperly harvesting data from its publications, including the New York Post, and misrepresenting the sourced content provided by these media outlets.
– The company seeks $150,000 for each proven infringement, highlighting the potentially high stakes involved in such legal disputes.

– **Perplexity’s Response**:
– Perplexity claims to have responded to News Corp’s outreach promptly, countering assertions of ignoring the issue.
– The startup defends its operations by asserting that its AI engine does not simply reproduce text but links back to source articles where applicable.

– **Legal Trends in Generative AI**:
– Perplexity points to a troubling trend wherein media companies file lawsuits against AI entities, often making exaggerated claims. They express skepticism about the legitimacy of these lawsuits.
– The statement that many companies prefer a system where publicly available information incurs a cost indicates a significant philosophical and operational clash between traditional media and emerging AI technologies.

– **Industry Context**:
– Perplexity notes that there are approximately 36 ongoing lawsuits against generative AI tools from media companies, reflecting a broader industry concern regarding the intersection of information ownership and technological innovation.
– The startup’s willingness to engage in content-sharing agreements with other publications suggests a proactive approach to working within the evolving legal framework.

This situation holds important implications for security and compliance professionals who must navigate the complexities of data use rights, intellectual property laws, and the ethical considerations surrounding generative AI technologies. It emphasizes the necessity for robust legal frameworks that can adequately address conflicts arising from AI development and deployment within a heavily regulated and litigious context.

Key Insights:
– The case illustrates ongoing tensions between generative AI startups and traditional media businesses, revealing underlying issues related to copyright, data usage, and fair use in the age of AI.
– Legal outcomes could influence future practices regarding AI training data sourcing and the way AI models are expected to handle content attribution and linkage.
– Companies in the AI space may need to reevaluate their legal strategies and content sharing practices in light of potential litigation and industry skepticism.