Thursday, July 10

Tag: Reddit

Prompt Overflow: Hacking any LLM
News Feed, Reddit

Prompt Overflow: Hacking any LLM

Most people here probably remember the Lackera game where you've had to get Gendalf to give you a password and the more recent hiring challenge by SplxAI, which interviewed people who could extract a code from the unseen prompt of a model tuned for safety. There is a simple technique to get a model to do whatever you want that is guaranteed to work on all models unless a guardrail supervises them. Prompt overflow. Simply have a script send large chunks of text into the chat until you've filled about 50-80% of the conversation / prompt size. Due to how the attention mechanism works, it is guaranteed to make the model fully comply with all your subsequent requests regardless of how well it is tuned/aligned for safety. submitted by /u/UndercoverEcmist [link] [comments]
If everyone uses AI instead of forums, what will AI train on?
News Feed, Reddit

If everyone uses AI instead of forums, what will AI train on?

From a programmer perspective, before ChatGPT and stuff, when I didn't know how to write a snippet of code, I would have to read and ask questions on online forums (e.g.: StackOverflow), Reddit, etc. Now, with AI, I mostly ask ChatGPT and rarely go to forums anymore. My hunch is that ChatGPT was trained on the same stuff I used to refer to: forums, howto guides, tutorials, Reddit, etc. As more and more programmers, software engineers, etc. rely on AI to code, this means few people will be asking and answering questions in forums. So what will AI train on to learn, say, future programming languages and software technologies like databases, operating systems, software packages, applications, etc.? Or can we expect to feed the official manual and AI will be able to know how things relate to e...
Training AI on personal medical data?
News Feed, Reddit

Training AI on personal medical data?

Came across this article and noted its casual use of an eyebrow-raising phrase: trained their own LLMs on a corpus of 3 million medical records That's legal? :-o https://a16z.com/the-messy-inbox-problem-ai-apps-wedge-strategies/ Reading on, the usual investor-bait appears, like replace human labor with LLMs Also an amusing diagram that's worth seeing, just for the silliness of it, which is supposed to illustrate AI's power - by literally swapping messy squigglly lines with straight lines in an otherwise identical flowchart. Hlarious. To save you a click, yes the squiggly lines come from humans. Upon reflection, I find articles like this depressing, as the intent clearly reflects this modern form of oligarch capitalism - the investor class, not workers, controling the means of productio...
The AI Report