Having AI systems try to outwit one another could help judge their intentions

May 6, 2018

Table of Contents

Take, for instance, an AI system designed to defend against human or AI hackers. To prevent the system from doing anything harmful or unethical, it may be necessary to challenge it to explain the logic for a particular action. That logic might be too complex for a person to comprehend, so the researchers suggest having another AI debate the wisdom of the action with the first system, using natural language, while the person observes.

Further details appear in a research paper.

Source: technologyreview.com

Tags :

comments powered by Disqus

What tech calls “AI” isn’t really AI

First, the problem itself is poorly defined: what do you mean by intelligence? Nature, with all her blind hideous strength, endless experimentation and wild wastes of infinite time, has only managed the trick once (by our narrow definition), with one species of tree-ape on a rolling green world. Even if you believe there’s intelligent biological life elsewhere, the stats aren’t promising.

Having AI systems try to outwit one another could help judge their intentions

Tags :

Share :

Related Posts

What tech calls “AI” isn’t really AI

The EU is trying to decide whether to grant robots personhood.

The Army Is Working on Brain Hacks to Help Soldiers Deal With Information Overload