Having AI systems try to outwit one another could help judge their intentions

Having AI systems try to outwit one another could help judge their intentions

  • May 6, 2018
Table of Contents

Having AI systems try to outwit one another could help judge their intentions

Take, for instance, an AI system designed to defend against human or AI hackers. To prevent the system from doing anything harmful or unethical, it may be necessary to challenge it to explain the logic for a particular action. That logic might be too complex for a person to comprehend, so the researchers suggest having another AI debate the wisdom of the action with the first system, using natural language, while the person observes.

Further details appear in a research paper.

Source: technologyreview.com

Tags :
Share :
comments powered by Disqus

Related Posts

We Need Bug Bounties for Bad Algorithms

We Need Bug Bounties for Bad Algorithms

Algorithmic auditors are a growing discipline of researchers specializing in computer science and human-computer interaction. They employ a variety of methods to tinker with and uncover how algorithms work, and their research has already sparked public discussions and regulatory investigations into the most dominant and powerful algorithms of the Information Age. From Uber and Booking.com to Google and Facebook, to name a few, these friendly auditors already uncovered bias and deception in the algorithms that control our lives.

Read More
Announcing PyTorch 1.0 for both research and production

Announcing PyTorch 1.0 for both research and production

PyTorch 1.0 takes the modular, production-oriented capabilities from Caffe2 and ONNX and combines them with PyTorch’s existing flexible, research-focused design to provide a fast, seamless path from research prototyping to production deployment for a broad range of AI projects. With PyTorch 1.0, AI developers can both experiment rapidly and optimize performance through a hybrid front end that seamlessly transitions between imperative and declarative execution modes. The technology in PyTorch 1.0 has already powered many Facebook products and services at scale, including performing 6 billion text translations per day.

Read More