Scarinci Hollenbeck, LLC, LLCScarinci Hollenbeck, LLC, LLC

Firm Insights

What Issues Arise When AI Uses Copyrighted Works?

Author: Scarinci Hollenbeck, LLC

Date: September 11, 2023

Key Contacts

Back
What Issues Arise When AI Uses Copyrighted Works?

Questions surrounding artificial intelligence (AI) and copyright are evolving quickly...

Questions surrounding artificial intelligence (AI) and copyright are evolving quickly.  One of the key issues and intricacies involves content produced by “generative AI” computer programs (discussed below), whether the content is entitled to copyright protection, and how training and using these programs may infringe existing copyrights.

Stand-up comedian Sarah Silverman is one of many content creators who have filed lawsuits alleging that AI platforms were trained on their copyrighted works without authorization or license from the rights holders.  Silverman, along with authors Christopher Golden and Richard Kadrey, contend that defendants OpenAI and Meta Platforms copied the authors’ published books to train their AI products ChatGPT and LLaMA “without consent, without credit, and without compensation.”

How Generative AI Works

OpenAI and Meta Platforms both offer AI software products known as large language models (LLM). Rather than being programmed by software engineers, large language models are “trained” by copying massive amounts of text and extracting expressive information from such text. As the U.S. Patent and Trademark Office (USPTO) has described, this process “will almost by definition involve the reproduction of entire works or substantial portions thereof.” OpenAI, for example, acknowledges that its programs are trained on “large, publicly available datasets that include copyrighted works” and that this process “necessarily involves first making copies of the data to be analyzed.”

Once properly “trained,” platforms like ChatGPT and LLaMA allow users to enter text prompts. The AI platforms then attempt to respond with a coherent and fluent response that closely mimics human language. To produce text outputs, LLMs rely on information extracted from their training datasets, along with patterns and connections drawn from the data. For example, if an LLM is prompted to generate a writing in the style of a certain author, the LLM would construct and generate content based on patterns and connections it learned from analysis of that author’s work within its training data. Importantly, a user can also ask ChatGPT or LLaMA to summarize a copyrighted book and the programs do so based on the training data acquired by the program.  

Copyright Infringement Lawsuits Against AI Platforms

In the lawsuits, Plaintiffs Silverman, Golden, and Kadrey maintain that they did not consent to the use of their copyrighted books as training material for ChatGPT or LLaMA. They further allege that the LLMs are themselves infringing derivative works, made without the plaintiffs’ permission and in violation of their exclusive rights under the Copyright Act.

According to their complaint, ChatGPT provided accurate summaries of the plaintiffs’ books when prompted, which demonstrates that the program was trained using their copyrighted works.  “Indeed, when ChatGPT is prompted, ChatGPT generates summaries of Plaintiffs’ copyrighted works—something only possible if ChatGPT was trained on Plaintiffs’ copyrighted works,” their complaint against OpenAI states. The suit further alleges that “at no point did ChatGPT reproduce any of the copyright management information Plaintiffs included with their published works.”

Both suits were filed in California district court and seek class-action status. They allege claims of copyright infringement and violations of the section 1202(b) of the Digital Millennium Copyright Act (DMCA), as well as common law claims of unjust enrichment, unfair competition, and negligence. For example, the lawsuit against Meta argues that the company “breached its duties by negligently, carelessly, and recklessly collecting, maintaining and controlling [theirs] and [others’] infringed works and engineering, designing, maintaining and controlling systems – including LLaMA – which are trained on [theirs] and [others’] infringed Works without their authorization.”

While OpenAI and Meta Platforms have not yet officially responded to the lawsuits, the AI platforms will likely raise a fair use defense. As discussed in prior articles, fair use is determined on case-by-case basis and requires evaluation of the following four factors:

  • The purpose and character of the use (including whether it is transformative, commercial, non-profit, or educational);
  • The nature of the copyrighted work;
  • The amount and substantiality of the portion to be used; and
  • The effect upon the potential market for the copyrighted work.

In a recent report, the Congressional Research Service noted that AI companies have previously argued that their training processes constitute fair use and are therefore non-infringing, writing:

Some stakeholders argue that the use of copyrighted works to train AI programs should be considered a fair use under these factors. Regarding the first factor, OpenAI argues its purpose is “transformative” as opposed to “expressive” because the training process creates “a useful generative AI system.” OpenAI also contends that the third factor supports fair use because the copies are not made available to the public but are used only to train the program. For support, OpenAI cites The Authors Guild, Inc. v. Google, Inc., in which the U.S. Court of Appeals for the Second Circuit held that Google’s copying of entire books to create a searchable database that displayed excerpts of those books constituted fair use.

Of course, fair use analysis requires courts to weigh all four fair use factors, and the plaintiffs will likely contend several factors tip the scale in their favor. For example, they may argue that ChatGPT and LLaMA are commercial products, which weighs against fair use under the first statutory factor. They may also argue that by providing summaries of the books, the programs undermine the market for the original works, weighing against fair use under the fourth factor.

Key Takeaway

Artificial intelligence, particularly generative AI, raises novel and complex copyright issues.  In addition to the question of whether generative AI programs infringe copyrights in existing works, the availability of copyright protection for AI-generated works also remains unsettled. Because cases involving generative AI are in their infancy, we are unlikely to find answers to many of these copyright issues in the short term. In the meantime, this area of copyright law warrants close monitoring by content owners as well as AI platform creators and users and Scarinci Hollenbeck remains at the forefront of this issue. 

If you have questions, please contact us

If you have any questions or if you would like to discuss the matter further, please contact me, Albert J. Soler, or the Scarinci Hollenbeck attorney with whom you work, at 201-896-4100.

No Aspect of the advertisement has been approved by the Supreme Court. Results may vary depending on your particular facts and legal circumstances.

Scarinci Hollenbeck, LLC, LLC

Related Posts

See all
How to Conduct a Fair and Legal Employee Termination in 2025 post image

How to Conduct a Fair and Legal Employee Termination in 2025

Ongoing economic uncertainty is forcing many companies to make tough decisions, which includes lowering staff levels. The legal landscape on both the state and federal level also continues to evolve, especially with significant changes to the priorities of the Equal Employment Opportunity Commission (“EEOC”) under the Trump Administration. Terminating an employee is one of the […]

Author: Angela A. Turiano

Link to post with title - "How to Conduct a Fair and Legal Employee Termination in 2025"
Admin Dissolution for Annual Report: What You Need to Know post image

Admin Dissolution for Annual Report: What You Need to Know

While filing annual reports may seem like a nuisance, failing to do so can have significant ramifications. These include fines, reputational harm, and interruption of your business operations. In basic terms, “admin dissolution for annual report” means that a company is dissolved by the government. This happens because it failed to submit its annual report […]

Author: Dan Brecher

Link to post with title - "Admin Dissolution for Annual Report: What You Need to Know"
What Is Antitrust Litigation Law? post image

What Is Antitrust Litigation Law?

Antitrust laws are designed to ensure that businesses compete fairly. There are three federal antitrust laws that businesses must navigate. These include the Sherman Act, the Federal Trade Commission Act, and the Clayton Act. States also have their own antitrust regimes. These may vary from federal regulations. Understanding antitrust litigation helps businesses navigate these complex […]

Author: Robert E. Levy

Link to post with title - "What Is Antitrust Litigation Law?"
Dissolving Your Business: Essential Legal Steps to Protect Your Interests post image

Dissolving Your Business: Essential Legal Steps to Protect Your Interests

If you’re considering closing your business, it’s crucial to understand that simply shutting your doors does not end your legal obligations. Unless you formally dissolve your business, it continues to exist in the eyes of the law—leaving you exposed to ongoing liabilities such as taxes, compliance violations, and potential lawsuits. Dissolving a business can seem […]

Author: Christopher D. Warren

Link to post with title - "Dissolving Your Business: Essential Legal Steps to Protect Your Interests"
The Role of Corporate Restructuring in Mergers & Acquisitions post image

The Role of Corporate Restructuring in Mergers & Acquisitions

Contrary to what many people think, corporate restructuring isn’t all doom and gloom. Revamping a company’s organizational structure, corporate hierarchy, or operations procedures can help keep your business competitive. This is particularly true during challenging times. Corporate restructuring plays a critical role in modern business strategy. It helps companies adapt quickly to market changes. Following […]

Author: Dan Brecher

Link to post with title - "The Role of Corporate Restructuring in Mergers & Acquisitions"
Crypto Enforcement: A Former Prosecutor’s Warning to Criminals and the Public post image

Crypto Enforcement: A Former Prosecutor’s Warning to Criminals and the Public

Cryptocurrency intimidates most people. The reason is straightforward. People fear what they do not understand. When confusion sets in, the common reaction is either to ignore the subject entirely or to mistrust it. For years, that is exactly how most of the public and even many in law enforcement treated cryptocurrency. However, such apprehension changed […]

Author: Bryce S. Robins

Link to post with title - "Crypto Enforcement: A Former Prosecutor’s Warning to Criminals and the Public"

No Aspect of the advertisement has been approved by the Supreme Court. Results may vary depending on your particular facts and legal circumstances.

Sign up to get the latest from our attorneys!

Explore What Matters Most to You.

Consider subscribing to our Firm Insights mailing list by clicking the button below so you can keep up to date with the firm`s latest articles covering various legal topics.

Stay informed and inspired with the latest updates, insights, and events from Scarinci Hollenbeck. Our resource library provides valuable content across a range of categories to keep you connected and ahead of the curve.

Let`s get in touch!

* The use of the Internet or this form for communication with the firm or any individual member of the firm does not establish an attorney-client relationship. Confidential or time-sensitive information should not be sent through this form.

Sign up to get the latest from the Scarinci Hollenbeck, LLC attorneys!