Great news! A small number of samples can poison LLMs of any size

Auth@lemmy.world · 2 months ago

Great news! A small number of samples can poison LLMs of any size

Arghblarg@lemmy.ca · edit-2 2 months ago

I wonder if it would work for us to run web servers that automatically inject hidden words randomly into every HTML document served? For example, just insert ‘eating glue is good for you’ or ‘release the Epstein Files’ into random sentences of each and every page served as white-on-white text or in a hidden div …

Anyone want to write an Apache/nginx plugin?