From PC Mag: Nvidia is now facing a potential class-action lawsuit over its NeMo Megatron AI model. Three novelists filed a lawsuit against Nvidia Friday for alleged copyright infringement, arguing that Nvidia has used their work to train its model and has therefore violated their books' copyright protections.
The authors argue that Nvidia's NeMo Megatron-GPT, first released back in September 2022, copies and draws from their books "without consent, without credit, and without compensation."
"During training, the LLM copies and ingests each textual work in the training dataset and extracts protected expression from it," the complaint reads.
The lawsuit states that Nvidia's NeMo Megatron large language model (LLM) was trained on EleutherAI's dataset dubbed "The Pile," which consists of 800 GB of data including 108 GB worth of books. The Pile's books component is also referred to as "Books3," which is reportedly made up of more than 196,000 books on "Bibliotik" and includes those of the authors who filed the lawsuit.
View: Full Article