| IANAL. For the tables there's very little transformation, and a huge chunk of verbatim text. I don't see how there is any gain versus just publishing the course numbers and titles. For figure 4 this might fall under "unpublished material" protections, which are: https://www2.archivists.org/publications/brochures/copyright... > Generally, material is considered unpublished if it was not intended for public distribution or if only a few copies were created and distribution was limited. > The law distinguishes between published and unpublished material and the courts often afford more copyright protection to unpublished material when an asserted fair use is challenged. > Rather, courts evaluate fair use cases based on four factors, no one of which is determinative in and of itself: 2) > Courts give more protection to works that are “closer to the core of copyright protection,” such as unpublished 4) > The effect of the use upon the potential market for, or value of, the copyrighted work: This factor assesses how, and to what extent, the use damages the existing and potential market for the original. Publication of the (possibly) previously unpublished copyrighted work in figure 4 fully and completely destroys its value. I don't know if a fair use claim can overcome such an impact, though that is up to a court to determine. |
IANAL either–but how is the copyright owner (MIT presumably) harmed by the reproduction of these course descriptions? It isn't like they harm the commercial value of the courses in any way; the course is the actual product here, the description is just sales and marketing collateral, and has minimal value apart from the product it is selling.
Furthermore, given the fact the paper was coauthored by MIT employees – arXiv could argue that MIT (through its employees acting as its agents) had granted them an implied license to reproduce it. Which is the other issue – even if this isn't fair use, MIT may have agreed to license it through its agents. You can still be bound by the actions of your employees, even if those actions violated your own internal policies–especially in dealings with third parties who had no reason to suspect there was any such violation.
> I don't see how there is any gain versus just publishing the course numbers and titles.
"Algebra I" and "Algebra II" don't mean much – what topics do they actually cover? A one sentence/paragraph course description adds a lot, because they tell you what topics are actually covered. Yes, someone could probably look it up on the MIT website – but it saves the reader a lot of effort doing that. Especially if someone is reading this 20 years from now, by which time the content of MIT courses may have changed a lot (despite having the same title), and finding what their content was 20 years ago may require a lot of research effort (if the reader even thinks to do that).
> Publication of the (possibly) previously unpublished copyrighted work in figure 4 fully and completely destroys its value
Figure 4 is likely not the "work", rather a small quote from a much larger work. How does a small quote from a work (even if allegedly unpublished) "fully and completely destroys its value"?