Can AI-generated Content be Copyrighted?

This question was posed in an article today in the WSJ: AI Generated Art for a Conic Book.  Human Artists are Having a Fit.  Reading the article there seems to be two angles.  The first is this: to be copyrighted the material created by the AI model needs to “show a modicum of creativity”.  And the powers that be will have to determine if AI can create something new or novel.  On the face of it, the art in question looks new or novel.

Creativity is as Creativity Does

But toward the early part of the article another angle is noted that has an impact on the main question.  This more interesting angle is not explored further in the article and refers to the data used to train the model.  The source data-set that was used to train the AI engine may have used copyrighted material. And that data might have been used without the copyright holders permission.  In other words, the training of the machine learning engine used novel data.  Just recently we read a lot about how ML models can synthesize a lot of data.

This becomes delectable.  Could the copyright decision on the AI-created art be novel if it hadn’t used novel data to train it?  What if the training data-set included standard, non-novel data?  Does that make a difference?  Could AI models create anything deemed creative if they cannot use unique data sets?

Value From the Sausage, or the Machine?

Bestseller No. 1
Rockland Melbourne Hardside Expandable Spinner Wheel Luggage, Black, 3-Piece Set (20/24/28)
  • Three-piece set of hard-side suitcases with...
  • Includes 20 inch, 24 inch , 28 inch Upright
  • Sturdy ergonomic aluminum telescoping handle
  • Due to differences in monitors/screens - Actual...
SaleBestseller No. 2
Samsonite Omni PC Hardside Expandable Luggage with Spinner Wheels, Checked-Medium 24-Inch, Teal
  • 24" SPINNER LUGGAGE maximizes your packing power...
  • PACKING Dimensions: 24” x 17.5” x 11.5”,...
  • 10 YEAR LIMITED WARRANTY: Samsonite products are...
  • MICRO-DIAMOND POLYCARBONATE texture is extremely...
  • SIDE-MOUNTED TSA LOCKS act to deter theft,...

Last update on 2024-04-05 / Affiliate links / Images from Amazon Product Advertising API

Are there not restrictions in place to limit the use of copyright data to train AI-based models?  I’d have thought so, but I am not a lawyer.  If this is not standard, there could be a market here.  If you license some data today, such as weather data, there are restrictions on its use.  Many organizations share data today.  Does this mean any organization that shares data should prohibits use of that data in AI-based software tools, without additional payment to the copyright owner?

If I buy a copyrighted manuscript there are limitations to usage.  I cannot reprint it in any form without permission.  What about data?  Since data has unique properties compared to physical assets, maybe we should all have standard clauses to limit use in derivative-based software tools such as AI. What about AI-generated code, with a model trained on copyrighted code scripts?

New
KAVU Delray Beach Crossbody Bag Lightweight Mesh Beach Pack - Cool Aqua
  • Compact and Sleek Design: The KAVU Delray Beach...
  • Adjustable Shoulder Strap: The bag features an...
  • Two Way Zip Closure: The main compartment of the...
  • Multiple Pockets: In addition to the main...
  • Durable Material: The bag is made from a...
New
Samsonite 68308-2209 Omni Hardside Luggage 20 inch Spinner Army Green Bundle with Deco Gear 10 Piece Luggage Accessory Kit
  • Samsonite Omni Hardside Luggage 20" Spinner Army...
  • Lighter. Stronger. Bolder. Brighter
  • Effortless Slotted Spinner Wheel Design | TSA Lock
  • Large Organization Pockets | Large expansion...
  • BUNDLE INCLUDES: Deco Gear Luggage Accessory Kit...

Last update on 2024-04-05 / Affiliate links / Images from Amazon Product Advertising API

It will be interesting to see what happens in this case.  It will be even more interesting to see how that decision impacts standard data sharing practices.  If the AI-generated comic is:

  • Successfully defended as copyright, the fees and controls for usage of copyrighted data inputted to AI-based models might be expected to go up significantly.
  • Determined not to be copyrightable, owners of data used for training of such creations will resist inclusion of their data in that training, as their IP will be less monetizable.  As such, additional controls and regulation will come to the fore.

Can AI-generated Content be Copyrighted?