|
|
|
|
|
by sfriedr
1069 days ago
|
|
Could you share more about copyright? For example, aren't you worried that now, with all kinds of lawsuits happening [1] and copyright issues that were found in existing datasets [2], that you might get threatening letters from a lawyer some day? I'm the author of [3] where we introduced one of the first natural-language datasets that test graduate mathematics for LLMs, but some of the prompts we took from a copyrighted book and therefore thought about excluding them. Having them in the public dataset would be really nice though, hence I'm keen about your experience. I'd also be keen to hear how your challenge against the DMCA on sharing LLaMA's weights goes? [1] https://www.theguardian.com/books/2023/jul/05/authors-file-a...
[2] https://arxiv.org/abs/2105.05241
[3] https://arxiv.org/abs/2301.13867 |
|
Personally, I'm not worried. It would be a damn shame if academics come under fire merely for trying to operate on the cutting edge of science. None of us were trying to make money; we just wanted to make something interesting.
> I'd also be keen to hear how your challenge against the DMCA on sharing LLaMA's weights goes?
Thanks! I think we might be putting up a website for it soon, if only to explain ourselves. In the meantime – I hate this phrase, since I don't want followers – the only way to keep informed is to follow my Twitter, and perhaps keep an eye on my HN comments.
You'll probably hear about it either way though, since it's a groundbreaking case. No one has tested the copyrightability of ML models before.