Hacker News new | ask | show | jobs
by flatiron 1197 days ago
Any concerns about sending code directly to a third party? I’m very hesitant to use any tool that is sending my assets to the cloud to be used who knows how.
5 comments

This is one of the reasons I'm so excited about being able to run large language models on my own laptop.

I've managed to get LLaMA to generate snippets of Python code already, and with some fine tuning I imagine it could become equivalent to Copilot. https://til.simonwillison.net/llms/llama-7b-m2#user-content-...

Although that is an outdated generated snippet as it doesn't use pathlib?

What if you added a type annotation to filename as Path?

Yeah it's a pretty terrible completion - GPT4 does that MUCH better.

But GPT4 doesn't run on my laptop using ~4GB of RAM.

Yes. The privacy concerns are huge. With that said, id use the shit out of this for open source development where privacy is not a concern.
Instead you have legal concerns when you are re-licensing code you don't know where it comes from.

Someone could have published a snippet of code with retained copyright (not FOSS) and GPT-4 slurped it up, splitting it back to you and somehow the law firm representing the author comes across it.

Have there been any tried cases of this happening yet, or is everyone still waiting for the legal precedent to be set for this situation?
This sounds like plausible deniability, and not legal problems, at the hobbiest level. Not something I am going to worry about.
Not sure about they're policy but OpenAI is addressing the concern.

https://techcrunch.com/2023/03/01/addressing-criticism-opena...

The large tech company I work at explicitly told us not to upload code or any private company info to ChatGPT or similar services. I assume other companies have the same policy.
I think companies are getting very receptive to uploading code with the new 30-day policy. We have been able to negotiate this for financial firms. Reach out and we might be able to make something happen.
I imagine openai will offer self hosted versions of this at some point for enterprises.
And they delete completions within 30 days.