Are Creators Aligned on Artificial Intelligence?

creators

One of many challenges with adoption of generative AI (GAI) tools is whether creators are willing to demonstrate a degree of solidarity on the matter—i.e., apply the principle we generally call fair trade. If Creator A uses a GAI that might be harmful to Creator B in a different field, and so on, will most creators take this broader perspective in a group effort to demand ethical uses of GAI?  Moreover, this question becomes intertwined with copyright because the use of GAI is a subject of evolving legal doctrine, meaning that creators who want to produce commercial content outside their core talents should be aware that the material produced may not be protectable under the law.

Two simple examples would be the self-published book author who might use an AI voice app to produce an audiobook, and the documentary filmmaker who might use an AI music generator to produce a soundtrack for a film. In both examples, creators in other fields—voice actors and composers respectively—are potentially harmed by the development and use of these AI tools, but 1) will the author and filmmaker take that consideration into account?; and 2) will the sound recordings in either case be protected by copyright?

In the case of the author using AI in lieu of hiring a narrator to produce the audiobook, I predict that under current doctrine, the sound recording would not be protected by copyright law because there is no human performance captured in that recording. Thus, remedies for any piracy of the audiobook would rely solely on the protection of the underlying literary work, which is effective—but if the sound recording is also protected and registered, that would be two works infringed instead of one.

This increases the potential damages for infringement, which puts the author/owner in a stronger position if she needs to take legal action. By this example, authors’ interests may be seen as aligned with those of professional book narrators. Hiring a narrator will not only achieve better quality in the reading, but capturing the human performance is also a basis for copyright attaching to the sound recording.

Similar considerations would apply to the filmmaker with the GAI soundtrack, although there may be other factors that provide the AI music with some protection we don’t find with the AI audiobook. One factor that may become relevant is whether the filmmaker can show that he exerted sufficient creative control over the final sounds. If so, he may be able to defend a claim of copyright in the soundtrack, but we are likely several years and a few lawsuits away from clear guidance on this question.

Another consideration with the soundtrack may be the Copyright Office’s current view that material using assistive AI “within a larger work” is protected. Creators should be careful about interpreting that broad language because constituent works that stand alone—and this would apply to a soundtrack for a film—would logically not be independently protected.

Of course, there are many GAI products that allow one type of creator to avoid hiring another type of creator for a given project. Some of this is inevitable, and it is not necessarily unethical or bad for creative culture. That said, even with ethically trained and ethically used AI tools, the copyright considerations should be weighed by the individual creator (i.e., do they care about protecting what might not be protectable?), but also collectively by all creators contributing to a new ecosystem.

Since 1978 in the U.S., the default is automatic copyright protection, even if most rights are never enforced. But as GAI is used to produce a lot of material that is not protected, it is hard to predict what effect this might have on copyright overall. Even older than automatic copyright with the 1976 Act, the human authorship principle fosters a new tension for creators who may wish to combine GAI and human-authored work. As a response to that tension, it would be a mistake in my view to overwrite the “human spark” doctrine and simply protect any material that “walks and talks” like a creative work. This isn’t just an emotional appeal to anthropocentrism but rather a conviction that copyright would become meaningless—even unconstitutional—by eroding the incentive rationale for its existence.

Regardless of the theoretical questions addressed in this post, I believe that as a practical matter, creators should think carefully about how and when to use GAI for various projects. As an ethical consideration, perhaps if you’re opposed to “scraping” in your industry, then opposing it in others is the right view to take. But as a business consideration, if what you’re making is meant to have commercial value, AI-generated might mean not protected by copyright—and that means even if you spend money and time on it, it isn’t yours.

Guarantee of Confusion: When AI Scrapes the News

news

That title riffs on the term of art in trademark law known as “likelihood of confusion.” It refers to a foundational test, which asks whether the average consumer will confuse a particular mark (words, design, or both) with a product or service that is not produced or distributed by the company associated with a known mark. Thus, beware the Rollex, Tilynol, or even the KleanEx. But when a real trademark is used to promote a defective product, confusion is certain—especially when the brand is a news producer.

In a lawsuit filed today by several major news publishers against an AI developer (Advance Local Media et al. v. Cohere Inc.), we see a good example of copyright and trademark combining to serve the public interest in contrast to the extensive harm that can be done by technology developers running roughshod over IP rights. Copyright incentivizes the investment in professional journalism needed to report reliable news, and trademark identifies the source of the news we choose to trust. I know readers will be inclined these days to criticize one news organization or another, but hold that thought.

The complaint filed in the District Court for the Southern District of New York names as plaintiffs several well-known news publishers (e.g., Condé Nast, Los Angeles Times, The Guardian) who allege that AI developer Cohere is liable for both copyright and trademark infringement. Valued at $5.5 billion, “Cohere’s primary product is its suite of LLMs referred to as the Command Family of models…these LLMs are trained on vast amounts of text and as a result can generate text-based, natural language responses to user queries,” the complaint states.

The Copyright Allegations

On copyright infringement, the publishers intend to show that Cohere violates their exclusive rights both when it inputs protected works to train the Command products and when it outputs verbatim or substantially similar works that are reproduced, distributed, and displayed to paying customers. The two counts of alleged trademark infringement stem from use of the publishers’ registered names in conjunction with erroneous material that may be “hallucinated” by the LLM. Clearly, anyone can recognize why this would be harmful to the reputation of the named source and broadly harmful to consumers who already struggle to validate information in this miasma we call the internet.

Notably, the Publishers stress the fact that Cohere markets itself on the reliability and timeliness of the information Command provides—benefits that would be essential for its many commercial customers, but which the company allegedly chose to accomplish through unlicensed use of the works produced by news organizations. “Cohere relies heavily on trusted journalism sources to shore up the authority of its responses. As Cohere’s CEO Aidan Gomez explained in a letter to employees and shareholders, Cohere believes that a ‘key differentiator’ for its models is the ability to receive ‘verifiable answers,” the complaint states.

Further, to support the veracity of query results, Cohere relies on “retrieval augmented generation” (RAG), which an NVIDIA blog post describes thus: “Like a good judge, large language models (LLMs) can respond to a wide variety of human queries. But to deliver authoritative answers — grounded in specific court proceedings or similar ones — the model needs to be provided that information.” This case law analogy is ironic in context because even at this very early stage, the copyright case law strongly suggests to this observer that Cohere should not have chosen the unlicensed path to build its products.

For example, a description from the complaint reminds me that the news summary product TV Eyes was held to be infringing on less compelling evidence than the following:  “The user can expand [the] Under the Hood [tool] to view the exact underlying documents on which Cohere relied to generate the response. Cohere refers to these sources as ‘snippets,’ but to be clear—these ‘snippets’ are generally the full text of every source on which the output was based.”

In fact, the allegations in this complaint imply so much familiar ground that it is hard to imagine how Cohere will raise a persuasive defense. For instance, just this week, I summarized the Delaware District Court finding that comparatively limited copying of Westlaw’s headnotes for an AI search product was considered a market substitute for the protected works. What Cohere is allegedly doing with news articles is similar in purpose but entails far more extensive, unlicensed use of substantially more protected expression than in Thomson Reuters v. Ross.

The Trademark Allegations

With the RAG tool switched on, Command will apparently provide reliable news by copying, distributing, and displaying unlicensed copies of Publishers’ works. But with RAG switched off, its LLM might hallucinate and then attribute the resulting misinformation to one of the named plaintiffs. For instance, the complaint cites a Cohere “article” that confuses the 2023 massacre at the Nova Music Festival with a 2020 shooting in Nova Scotia; reports that a man murdered at the latter “returns to the scene” of the former; and then attributes this whole mess to The Guardian.

The Publishers allege that Cohere violates two counts of the Lanham Act—trademark infringement and false designation of origin—both of which seem highly plausible based on the facts presented. We shall see whether Cohere can present compelling facts to rebut the allegations, but otherwise, as to the questions of law in this case, I predict this one easily goes to the plaintiffs.

As mentioned above, I know some readers may scoff at the premise that quality journalism is consistently the hallmark of well-established news publishers today. And to be sure, one must occasionally check the math in various articles and editorials. But I maintain that Big Tech, through its predatory model of monetizing everything it does not create—plus our willingness to believe utter nonsense online—exerts a pressure on professional journalism that borders on an existential threat. Left unchecked, the AI shenanigans like those described in this lawsuit do more than violate IP law; they undermine the efforts of any reporter who is still trying to present reality.


Photo by AndreyPopov

Music Artists Ask Congress to Finally Correct the Radio Royalties Mistake

music

Today is World Radio Day, and when most of us think of radio, we think of music. That’s why today, Congress received a letter signed by about 300 performing artists asking lawmakers to pass the long-overdue American Music Fairness Act (AMFA) this session. “Each year, AM/FM radio stations play nearly a billion songs. And each year, giant radio corporations rake in billions in advertising dollars while refusing to pay a single cent to the artists behind the music that attracts their advertisers in the first place and makes their entire business model possible,” the letter states.

The artists reiterate what presidents and legislators have known for generations—that it is absurd that the nation which produces some of the most popular recorded music in the world is almost the only democracy that does not pay artist royalties for AM/FM radio play. “This leaves the U.S. in the dubious company of North Korea, Cuba, and Iran,” the letter reminds Capitol Hill. And in a time of deep political divisions, lawmakers agree that this ongoing conflict between broadcasters and musical artists is unfounded and unfair.

As the letter reminds Congress, this is an international trade matter as much as it is a domestic issue of economics and fairness. Although most nations where American music is played on the radio do pay royalties, American artists receive nothing abroad solely because foreign artists receive nothing in the U.S. Plus, the core principle of AMFA is popular with voters. As the letter states, “…by a 6-1 margin Americans support Congress addressing this injustice….” That many Americans don’t agree on much these days, but Congress can feel confident that their constituents do support equity and fairness for American music artists.

AMFA’s Pedigree is Long – Too Long

Under copyright law, musical works, which are created by songwriters, are protected separately from sound recordings, which are created by performing artists who may or may not also be the songwriters. Broadcast, terrestrial radio networks have long paid royalties to songwriters for public performance of musical works but never to performing artists for airing the sound recordings.

AMFA, first introduced in June of 2021 by former Congressman Ted Deutch and co-sponsor Darrell Issa, is the legislative descendant of many bills over many decades proposing to remedy this inequity. During that time, artists have even earned royalties from digital transmission thanks to an amendment to the Copyright Act passed in 1995—sixteen years before Spotify became popular. So, to say the terrestrial radio fix is overdue is an understatement. I have advocated AMFA and explained its provisions in several posts, but I will copy and paste the basic terms again here:

For smaller stations (under $1.5 million/year), the AMFA caps royalties between $10/year to $500/year depending on revenue and status as either a public or private station. For larger stations and networks, rates would be set, as they for the rest of the performance licensing market, by the Copyright Royalty Board (CRB). Under the provisions of AMFA, the CRB must consider station size and revenue when setting rates and must also consider the station’s promotional value to recording artists.

The broadcasters have no plausible, factual argument to oppose the terms of AMFA; the basic principle of the bill is popular with Americans who love the music made by the signatories of this letter; and lawmakers already know all the reasons why passing this bill is the right thing to do. Time to get it done.


Photo source by Kjekol