• Breve
    link
    fedilink
    English
    arrow-up
    2
    arrow-down
    2
    ·
    edit-2
    3 hours ago

    My point is that you can’t talk about usage rights of a dataset without talking about a specific use case. The suggested use case was to provide a static test dataset for systems developed to use the firehose API, but the dataset could be used for literally anything from making funny memes (fair use) to training a LLM model (arguably not fair use). Does the existence of an illegal use case automatically mean the dataset itself should be illegal though?

    As a collorary, a photocopier can be used to create unauthorized reproductions of copyrighted works. Should making and disturbing photocopiers be illegal because they are capable of and used in the process of violating copyright law, or should we accept the photocopier absent of a use case isn’t breaking any laws and go after the people who use them to illegally create unauthorized reproductions?