Discussion about this post

User's avatar
Dan Davies's avatar

This was also in "The Coming Race" by Bulwer-Lytton, in which the alien race used five year olds to pilot their war machines, because they had no conscience or empathy and could be more destructive

Tex Pasley's avatar

Anthropic appears to mostly get mocked for things like the vending machine experiment, but I actually quite like this approach, both for testing the boundaries of AI systems themselves and also for thinking about useful LLM applications. Scenario planning seems like a good one. It will take some tinkering with the human-computer interface, but it seems like the psychosis inducing properties of an LLM are actually helpful here. That property makes it possible to (hopefully) safely trick everyone (human & machine) into believing they're in an alternative state where things are really FUBAR.

3 more comments...

No posts

Ready for more?