Reversal in Thomson Reuters Case May Bode Well For Copyright Owners Against AI

Thomson

It has already caught the attention of most copyright watchers that Judge Bibas of the District Court for the District of Delaware (3rd Circuit) reversed his own 2023 summary judgment ruling in the copyright AI case Thomson Reuters v. Ross Intelligence. Thompson, which owns the legal research database Westlaw, sued Ross for copyright infringement after the latter built its competitive AI-powered search tool by copying over 2,000 headnotes from Westlaw. Headnotes contain summaries which the court finds are sufficiently original for copyright protection, and it also finds that the material is protected under the doctrine of “selection and arrangement.”

Judge Bibas found copyright infringement of the headnotes and held that Ross’s defenses, including fair use, all failed. It is the fair use ruling that may be predictive of outcomes in other cases alleging copyright infringement for the purpose of AI training. Notably, Judge Bibas held that fair use factors one and four favored Thomson, and that Thompson prevails overall on fair use. To review, my amended summaries of the fair use factors are:

  • The purpose of the use, including whether the use is commercial.
  • The nature of the work used (i.e., whether it is more factual or creative).
  • The amount of the work used, including whether the “heart” of the work was used.
  • The potential market harm to the work used, namely whether the use substitutes for a use that the copyright owner retains the exclusive right to exploit in the market.

In Thomson, it is compelling that the court finds factors one and four go to plaintiff and that these carry the fair use finding overall when factors two and three go to defendant Ross. I say this because in other AI cases involving ingestion of entire visual, musical, and literary works, factors two and three will surely go to plaintiffs, and the AI developers can only hang their hopes on factors one and four.

Under factor one, Judge Bibas held that Ross’s use was clearly commercial and that the purpose of the use serves essentially the same purpose as the works used. Here, the opinion uses language that could benefit other AI developers, but not necessarily. It states:

Ross was using Thomson Reuters’s headnotes as AI data to create a legal research tool to compete with Westlaw. It is undisputed that Ross’s AI is not generative AI (AI that writes new content itself). Rather, when a user enters a legal question, Ross spits back relevant judicial opinions that have already been written.  

On the one hand, that parenthetical note that Ross is “not generative” could be cited to argue that generative AIs like Midjourney or Udio favor a finding of transformativeness under factor one. But several of the strongest cases against the developers present similar evidence of “spitting back” copies of the material ingested. Further, as emphasized in Udio and Suno, two AIs built on ingesting protected sound recordings, plaintiffs also present a strong argument that the GAIs serve the same purpose as the works used and, therefore, the purpose is not transformative.

Where a court finds under factor one that an infringing use serves the “same purpose” as the work used, this will often, quit logically, lead to finding market substitution under factor four. Here, Judge Bibas is forthright in his reversal about his initial instinct to leave factor four as a question of fact to be decided by the jury. Most notably, in my view, he writes…

I worried whether there was a relevant, genuine issue of material fact about whether Thomson Reuters would use its data to train AI tools or sell its headnotes as training data. And I thought a jury ought to sort out “whether the public’s interest is better served by protecting a creator or a copier.”

Those first considerations from 2023 reprise two familiar arguments presented in fair use defenses, but which courts have generally found unpersuasive in recent high-profile cases. That the plaintiff is not yet in the market being pursued by the defendant has been held erroneous because it fails to properly consider the “potential” market for the protected works. Next, the “public interest” (i.e., for innovation’s sake) argument has been held too broad in major fair use cases—except Google v. Oracle, which is an outlier for several reasons. Thus, in reversing his thinking, Judge Bibas writes…

Even taking all facts in favor of Ross, it meant to compete with Westlaw by developing a market substitute. And it does not matter whether Thomson Reuters has used the data to train its own legal search tools; the effect on a potential market for AI training data is enough. Ross bears the burden of proof. It has not put forward enough facts to show that these markets do not exist and would not be affected.

Because factor two is generally considered the least important and factor four has long been considered the most important, Judge Bibas rests on that precedent to find that fair use overall favors Thomson. What this decision could signal for many AI developers who have copied millions of creative works to train their models is that the generalized “innovation and important for society” arguments will find slippery footing when they argue fair use.

Congress Should Only Pass AM Radio Act with the American Music Fairness Act

music fairness

Two bills are back in motion in the U.S. Congress—the AM Radio in Every Vehicle Act, and the American Music Fairness Act (AMFA). As I argued in a post for The Hill last May, if the first bill is to become law, then the second bill should also become law. While the AM radio provision arguably has some public-serving benefits, it is unavoidably a favor to broadcasters—and to pass it without finally delivering long overdue justice for music artists would be unconscionable.

AM Radio in Every Vehicle

The AM Radio in Every Vehicle Act, mandating that all new vehicles include receivers for the AM band, was initially drafted in response to the auto industry signaling a move away from AM radio in new cars. For instance, in electric vehicles (EV), the powertrains interfere with AM radio reception, but even manufacturers of traditional powertrain vehicles were recognizing a downward trend in AM listeners and, therefore, planning to discontinue AM receivers in new sound systems. Thus, Senator Ed Markey (D-MA) and Representative Josh Gottheimer (D-NJ) sponsored the AM Radio in Every Vehicle bills, which gained bipartisan support, albeit along partisan lines.

Republicans emphasized the importance of conservative talk radio while Democrats highlighted the role of AM in the emergency broadcast system and its reach to underserved markets. Make of those rationales what you will (e.g., most people likely get their emergency messaging on their cellphones), but there is no question that AM in Every Vehicle would be a legislative favor to broadcasters, extending the life of terrestrial radio as the market moves toward alternatives like streaming and podcasts. Even if the market eventually abandons the AM band, the bill can serve as a precedent for FM radio, where more drivers listen to music. And that is significant because U.S. broadcasters still do not pay royalties to music artists for terrestrial radio play. That’s where AMFA comes in.

American Music Fairness Act (AMFA)

While the AM Radio bill was reintroduced this week by Rep. Frank Pallone, Jr. (D-NJ), Senators Marsha Blackburn (R-Tenn.), Alex Padilla (D-Calif.), Thom Tillis (R-N.C.), and Cory Booker (D-N.J.) reintroduced AMFA. “The United States is the only democratic country in the world in which artists are not paid for the use of their music on AM and FM radio,” said Senator Blackburn. “This legislation would close an outdated loophole that has allowed corporate broadcasters to take advantage of artists and their songs for decades.”

That’s it in a nutshell. And in a time when bipartisanship is in short supply, one would be hard-pressed to find a Member of Congress who does not agree that radio broadcasters should pay royalties to music artists for their vital contributions to the stations. The broadcasters will oppose AMFA, as they have always opposed royalty legislation, on the basis that 1) they cannot afford the royalties; and/or 2) their promotional value outweighs the royalty value. These claims are overstated, but even if they were not, the AMFA bill answers both. As describe in an older post:

For smaller stations (under $1.5 million/year), the AMFA caps royalties between $10/year to $500/year depending on revenue and status as either a public or private station. For larger stations and networks, rates would be set, as they for the rest of the performance licensing market, by the Copyright Royalty Board (CRB). Under the provisions of AMFA, the CRB must consider station size and revenue when setting rates and must also consider the station’s promotional value to recording artists. It’s hard to imagine how the deal gets more fair than that.

As composers and artists have noted in many contexts, music is more regulated in terms of price than most other products on the market. While radio broadcasters are free to charge what the market will bear for advertising on their networks—and the largest entities earn billions in revenue—the music artists, with AMFA, are asking for a regulated price of more than zero for use of the only product that draws listeners to the stations in the first place. (Or do you listen to radio for the commercials?)

Further, not only should Congress condition passing AM Radio on passing AMFA, but it should set aside the political theater of a little-known resolution called the Local Radio Fairness Act (LRFA). Since at least 2008, this insincere (one might say cynical) never-to-be-passed resolution is akin to a loyalty pledge to the broadcasters, promising not to mandate royalties for musical artists. For instance, last year’s resolution highlighted the unfounded implication that radio play only benefits the artists rather than the honest assessment expressed in AMFA that the benefits are at least mutual if not generally tilted in favor of the broadcasters.

If Congress wants to mandate that AM radio remain in automobiles for the foreseeable future, so be it. But Members should also acknowledge that the law is a gift to the broadcasters and a precedent for a similar mandate for FM in the future. As such, it is only fair that Congress finally require American radio stations to pay artists for the music without which many radio stations would have little or no value whatsoever.


Photo by Ababil12

The Copyright War Was Never Just About Copyright

copyright war

The so-called “copyright war” began years before I joined the fight, arguably in 1999, when defenders of the P2P platform Napster equated music piracy with liberty. Thus, rather than a rational discussion about the interdependence of creators and technology, Big Tech cultivated a syncretic foundation from which to sell the paradox that devaluing individual rights was somehow good for democracy. After all, it was easy to promote the view that copyright only mattered to wealthy rockstars and giant corporations while eliding the subtle but significant fact that it is a constitutional right of every citizen.

The “copyright war” peaked in the public eye in 2011/12 when the tech-funded campaign to defeat the anti-piracy bills SOPA and PIPA raged across the social platforms, convincing even professional creators that free speech and the “internet as we know it” were doomed if those bills became law. Outlandish lies about the perils of that legislation—mostly crafted and promoted by left-leaning organizations, journalists, and academics—masked the cyberlibertarian philosophy of tech’s most influential figures. Because for many leading tech-bros, it was never just about copyright, but rather, their barely disguised contempt for rights in general and that outdated political model we call the Republic. And what better way to hide an anti-democratic agenda than in plain sight with populist slogans like democratization?

Timothy Snyder, in a recent article making the point that destruction of the American state is the only agenda of tech oligarchs, writes: “The logic of ‘move fast and break things,’ like the logic of all coups, is to gain quick dramatic successes that deter and demoralize and create the impression of inevitability. Nothing is inevitable.” Citing Facebook’s old motto in context to events in Washington since the start of Trump 2.0 is spot on. Yes, for many tech companies the assault on rights like copyrights was purely about siphoning wealth from creators, but for the biggest egos in the room—like the guys on stage at the inauguration—it’s about usurping power.

When I first jumped into the copyright fight in 2011, I was lectured by Mike Masnick and others that I simply didn’t understand the economic concept of creative destruction. My friends and I were accused of “clinging to buggy whips in a world of automobiles,” failing to see how new technologies opened up new opportunities for artists, even as they closed “outdated” modes of production and distribution. But in the same way those messages distorted the math—omitting evidence of destruction without creation—the “digital rights” crowd ignored or endorsed the fact that what they were really promoting was the an ideological, anti-democratic agenda.

The idea that an independent photographer or songwriter might dare to remedy widespread, unlicensed uses of their works online was not just financially anathema to platform owners’ interests, it was philosophically repugnant that the “unthinking demos,” to quote Peter Thiel, should have any say whatsoever about the geniuses building our new utopia. This is the real spirit behind the “don’t stifle innovation” talking point—promoting that alleged inevitability cited by Snyder, now being used to push unfocused development of artificial intelligence as a mandate without public oversight.

From a broad perspective, the ideological assault on copyright was a powerful framework for teaching citizens to disregard the rights and dignity of other citizens through the anonymizing medium of digital technology—a “concealing paint emancipating us into savagery,” to borrow from William Goldman. Getting permission to use a creator’s work was scorned by the same rationales that Redditors applied to sharing stolen nudes of celebrities or which tech-evangelists still use to justify all manner of toxic content under the general view that it’s all just speech.

Parallel to Big Tech’s attack on the very idea of permission and respect for individual rights, the major platforms arrogated the role of oversight—and everyone fell for the trick. It was illusory oversight, of course, but both message and perception were that Facebook, Twitter, et al. offered better transparency (i.e., more truth) than any journalist or civil servant ever could. As more news and politics populated social media, the word “sunlight” was often repeated as a talismanic code, which meant that no government agent, no journalist, no expert could be trusted because the “real truth” lies somewhere in the morass of alternate realities hosted on the web.

The assault on copyright was often described by my friends and colleagues as a “canary in a coal mine” because it was easy for most observers to compartmentalize the “war” as a minor skirmish that, at worst, might deprive already rich people of a few bucks. In truth, it was one battle in a broader war that has now manifest in lawless, incompetent, and violent individuals mucking about in the federal government—including one non-American tech oligarch getting his mitts all over our public affairs without any oversight. As shocking as these events may be, they’re not surprising. Big Tech said, “Disrupt everything.” They weren’t kidding.