Streaming News

Court Dismisses Authors’ Copyright Infringement Claims Against OpenAI * TorrentFreak


openai logoopenai logoIn recent months, rightsholders of all ilks have filed lawsuits against companies that develop AI models.

The list includes record labels, individual authors, visual artists, and even the New York Times. These rightsholders all object to the presumed use of their work without proper compensation.

Several of the lawsuits filed by book authors include a piracy component. The cases allege that tech companies, including Meta and OpenAI, used the controversial Books3 dataset to train their models.

The Books3 dataset was created by AI researcher Shawn Presser in 2020, who scraped the library of ‘pirate’ site Bibliotik. The general vision was that the plaintext collection of more than 195,000 books, which is nearly 37GB in size, could help AI enthusiasts build better models.

The vision wasn’t wrong; large text archives are great training material for Large Language Models, but many authors disapprove of their works being used in this manner, without permission or compensation….

You can read the full Torrent Freak article here

Related Articles

New UK Cybersecurity Campaign Leads Kids Directly to Pirate Apps & Malware * TorrentFreak

Top Tutorials

WORLD'S FIRST TRULY BI-DIRECTIONAL SIMULTANEOUS TRANSLATION EARBUDS!!!

Top Tutorials

Launcher Manager for Firestick blacklisted by Amazon

Top Tutorials

Leave a Comment