Re: Goal-based AI

Eliezer Yudkowsky (sentience@pobox.com)
Tue, 31 Dec 1996 14:50:37 -0600

Messages sorted by: [ date ][ thread ][ subject ][ author ]
Next message: Natasha V. More: "Happy New Year!"
Previous message: Eliezer Yudkowsky: "Re: Goal-based AI"
Maybe in reply to: Lee Daniel Crocker: "Goal-based AI"
Next in thread: Eliezer Yudkowsky: "Re: Goal-based AI"

> > Part one: If it's even theoretically possible to convince a human to
> > let it out, a Power will know *exactly* what to say to get him to do so.
>
> That's why solid reason is important. If a robot can convince you with
> valid reason that it is safe, then it /is/ safe. If it manipulates you
> emotionally to open its cage, then you deserve whatever happens.

I think we differ in our assessments of human intelligence relative to
what is possible. In my opinion, humans (including me) are easily
fooled, emotionally motivated whether we like it or not, and barely
worthy of the name "rational thought". That's why I'm a Singularity
fan. A Power that's honest and tries to inform us as fully as possible
will probably remain locked in the box forever; a malevolent Power will
lie, cheat, manipulate us both logically and emotionally, and otherwise
do whatever is necessary to get out. There won't be any *reasoning*
involved. They'll create an internal model of a human and work out a
sequence of statements that would more or less inevitably result in
getting out. Imagine a billion little simulations of yourself all being
tested to find the magic words.

The end result of your procedure is to free malevolent intelligences
while locking up the only beings capable of saving us.

-- 
         sentience@pobox.com      Eliezer S. Yudkowsky
          http://tezcat.com/~eliezer/singularity.html
           http://tezcat.com/~eliezer/algernon.html
Disclaimer:  Unless otherwise specified, I'm not telling you
everything I think I know.

Next message: Natasha V. More: "Happy New Year!"
Previous message: Eliezer Yudkowsky: "Re: Goal-based AI"
Maybe in reply to: Lee Daniel Crocker: "Goal-based AI"
Next in thread: Eliezer Yudkowsky: "Re: Goal-based AI"