Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This is the biggest thing holding gpt back. Everyone with meaningful data has their hands tied behind their back. So many ideas and the answer is “we can’t put that data in gpt” very frustrating.


Another way of looking at that is that gpt not being open source so companies can run it on their own clusters is holding it back.


Back in the day Google offered hardware search appliances.

Offering sealed server boxes with GPT software, to run on premises heavily firewalled or air-gapped could be a viable business model.


[ A prompt that gets it to decompile itself. With good inline documentation too! ]


I'm afraid that even the most obedient human can't readily dump the contents of their connectome in a readable format. Same likely applies to LLMs: they study human-generated texts, not their own source code, let alone their tensors' weights.


Well, what they study is decided by the relevant hoominz. There's nothing actually stopping LLMs from trying to understand their own innards, is there ? Except for the actual access.


Sounds like an easy problem to solve if this is actually the case.

OpenAI just has to promise they won't store the data. Perhaps they'll add a privacy premium for the extra effort, but so what?


Anyone that actually cares about the privacy of their data isn’t going to be satisfied with just a “promise”.


A legal binding agreement, whatever.


Still not enough. Seriously. Once information is out there it cannot be clawed back, but legal agreements are easily broken.

I worked as a lawyer for six years; there are extremely strict ethical and legal restrictions around sharing privileged information.


Hospitals are not storing the data on a harddrive in their basement so clearly this is a solvable problem. Here's a list of AWS services which can be used to store HIPAA data:

https://aws.amazon.com/compliance/hipaa-eligible-services-re...

As you can see, there is much more than zero of them.


The biglaw firms I’m familiar with still store matter data exclusively on-prem. There’s a significant chunk of floor space in my office tower dedicated to running a law firm server farm for a satellite office.


This might have been true 10-15 years ago. But I've worked at plenty of places that store/process confidential, HIPAA, etc data in the cloud.

Most company's confidential information is already in their Gmail, or Office 365.


> I worked as a lawyer for six years; there are extremely strict ethical and legal restrictions around sharing privileged information.

But Microsoft already got all the needed paperwork done to do these things, it isn't like this is some unsolved problem.


You can't unring a bell. Very true.

Nevertheless, the development of AI jurisprudence will be interesting.


What if there's a data breach? Hackers can't steal data that OpenAI doesn't have in the first place.


Or legal order. If you're on-site or on-cloud and in the US then it might not matter since they can get your data anyway, but if you're in another country uploading data across borders can be a problem.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: