Engadget has been testing and reviewing consumer tech since 2004. Our stories may include affiliate links; if you buy something through a link, we may earn a commission. Read more about how we evaluate products.

DeepMind's latest AI can master games without being told their rules

MuZero represents a likely breakthrough in general-purpose AI.

Igor Bonifacic

·Contributing Reporter

23 December 2020 at 11:00 am·4-min read

In 2016, Alphabet's DeepMind came out with AlphaGo, an AI which consistently beat the best human Go players. One year later, the subsidiary went on to refine its work, creating AlphaGo Zero. Where its predecessor learned to play Go by observing amateur and professional matches, AlphaGo Zero mastered the ancient game by simply playing against itself. DeepMind then created AlphaZero, which could play Go, chess and shogi with a single algorithm. What tied all those AIs together is that they knew the rules of the games they had to master going into their training. DeepMind's latest AI, MuZero, didn't need to be told the rules of go, chess, shogi and a suite of Atari games to master them. Instead, it learned them all on its own and is just as capable or better at them than any of DeepMind's previous algorithms.

Creating an algorithm that can adapt to a situation where it doesn't know all the rules governing a simulation, but it can still find a way to plan for success has been a challenge AI researchers have been trying to solve for a while. DeepMind has consistently attempted to tackle the problem using an approach called lookahead search. With this method, an algorithm will consider future states to plan a course of action. The best way to wrap your head around this is to think about how you would play a strategy game like chess or Starcraft II. Before making a move, you'll consider how your opponent will react and try to plan accordingly. In much the same way, an AI that utilizes the lookahead method will try to plan several moves in advance. Even with a game as relatively straightforward as chess, it's impossible to consider every possible future state, so instead an AI will prioritize the ones that are most likely to win the match.

The problem with this approach is that most real-world situations, and even some games, don't have a simple set of rules governing how they operate. So some researchers have tried to get around the problem by using an approach that attempts to model how a particular game or scenario environment will affect an outcome and then use that knowledge to make a plan. The drawback of this system is that some domains are so complex that modeling every aspect is nearly impossible. This has proven to be the case with most Atari games, for instance.

In a way, MuZero combines the best of both worlds. Rather than modeling everything, it only attempts to consider those factors that are important to making a decision. As DeepMind points out, this is something you do as a human being. When most people look out the window and see dark clouds forming on the horizon, they generally don't get caught up thinking about things like condensation and pressure fronts. They instead think about how they should dress to stay dry if they go outside. MuZero does something similar.

It takes into account three factors when it has to make a decision. It will consider the outcome of its previous decision, the current position it finds itself in and the best course of action to take next. That seemingly simple approach makes MuZero the most effective algorithm DeepMind made to date. In its testing, it found MuZero was as good as AlphaZero at chess, Go and shogi, and better than all its previous algorithms, including Agent57, at Atari games. It also found that the more time it gave MuZero to consider an action, the better it performed. DeepMind also conducted testing in which it put a limit on the number of simulations MuZero could complete in advance of committing to a move in Ms Pac-Man. In those tests, it found MuZero was still able to achieve good results.

Putting up high scores in Atari games is all well and good, but what about the practical applications of DeepMind's latest research? In a word, they could be groundbreaking. While we're not there yet, MuZero is the closest researchers have come to developing a general-purpose algorithm. The subsidiary says MuZero learning capabilities could one day help it tackle complex problems in fields like robotics where there aren’t straightforward rules.

Yahoo News UK
Julia Fox criticised over 'disgusting' vagina bikini outfit by FGM campaigners
Julia Fox has been accused of wearing the trauma of female genital mutilation survivors in her provocative outfit that shows a sewn-up vagina.
a day ago
HuffPost
Ex-Aide Sums Up Donald Trump’s Attitude To Melania Trump With 3 Words
Stephanie Grisham also recalled a telling telephone call the former president made about his wife.
a day ago
NCA NewsWire
‘Dripping blood’: More knife terror hits shops
Another shocking blast of knife violence has hit Australia, this time a teen brawl featuring a machete at a popular suburban shopping centre.
14 hours ago
HuffPost
Viewers Think Laura Ingraham Just Made A Big Admission Of Guilt For Trump
The Fox News host got called out for her characterization of Trump's relationship with Stormy Daniels.
9 hours ago
NCA NewsWire
Pair found dead at home identified
The names of a man and woman found dead inside a suburban home have been revealed, as homicide squad detectives continue their investigation.
a day ago
The Daily Beast
Trump Is Already Losing in Court—and the Judge Isn’t Playing
Photo Illustration by Thomas Levinson/The Daily Beast/GettyThe jurors haven’t even been selected yet. The trial really hasn’t even started. But as Donald Trump’s first day in criminal court wrapped up on Monday, the former president already seemed to be losing.New York Supreme Court Justice Juan Merchan displayed little patience for, as he insisted on calling him, “Mr. Trump.” That the judge is already over the former president’s antics is abundantly clear. And as Trump’s lawyers repeatedly trie
19 hours ago
Cosmo
Zendaya's itsy-bitsy, retro halter dress is a *serve*
The actress Zendaya wore a retro, white mini halter neck dress to a photo call in Milan for the Challengers press tour - and she looks amazing
2 days ago
Yahoo Sport Australia
Travis Head makes history in IPL as cricket world erupts over record-breaking scenes
Travis Head and Pat Cummins both shined for Sunrisers Hyderabad. Read more here.
23 hours ago
The Independent
Controversial TikTok personality Kyle Marisa Roth dead at 36
Tik Tokker - known for her Hollywood ‘blind items’ and hot takes - has died at age 36
20 hours ago
NCA NewsWire
Higgins’ fiance breaks his silence on rape
Brittany Higgins’ partner David Sharaz has broken his silence after the Bruce Lehrmann judgment, taking to social media following the Federal Court’s decision.
2 days ago
Yahoo Sport Australia
Jason Day hit with ban at Masters after golf officials take exception to Aussie's outfit
Jason Day left viewers baffled after the bold move during the Masters. Find out more here.
2 days ago
The Independent
‘My hoo haa is gonna be out’: Nike’s US Olympic outfits need ‘constant pube vigilance’ say frustrated athletes
The brand said it is offering tailors to athletes and additional clothing options
a day ago
Australian Associated Press
Liar, rapist: Ten 'vindicated' in Lehrmann verdict
Bruce Lehrmann was so hell-bent on having sex with Brittany Higgins in Parliament House that he was indifferent to her consent and raped her, a judge has found.
2 days ago
The Daily Beast
Gloves Off! DA Wants Trump Punished for Contempt on Day 1
Photo by Jabin Botsford-Pool/Getty ImagesThe Manhattan District Attorney’s Office wants the judge overseeing the first trial against a former American president to start it off with a lunging attack, asking the court to personally sanction Donald Trump for his verbal onslaught against witnesses in the case.Shortly after noon, Assistant District Attorney Christopher Conroy formally asked the judge to fine Trump “$1,000 for each post that violates the court order,” “direct the defendant to take do
a day ago
Yahoo News Australia
Prime Minister offers Bondi's 'bollard man' Australian citizenship after 'extraordinary bravery'
Prime Minister Anthony Albanese has told hero Damien Guerot he can 'stay for as long as he likes'.
16 hours ago
Yahoo News Australia
Mum unable to use her car for hours after 'frustrating' parking act
With an unknown car parked centimetres behind hers, she was unable to access the boot to get what she needed for work.
2 days ago
Yahoo Sport Australia
Latrell Mitchell picks Souths over State of Origin as NRL star misses NSW meeting
Latrell Mitchell, Damien Cook and Cody Walker are focusing on helping the Rabbitohs. Read more here.
2 days ago
Variety
Hannah Waddingham Tells Photographer ‘Don’t Be a D—’ After Red Carpet Comment: ‘You Would Never Say That to a Man’
Hannah Waddingham had some choice words for a photographer in response to a comment made about her on the red carpet at Sunday night’s Olivier Awards. As captured on video posted by X user @odeiotedlasso, Waddingham was posing for photos on the red carpet when she stopped to address a photographer. Though what the photographer …
a day ago
HuffPost
Critics Trash Trump For Most ‘Deranged’ Claim Yet Ahead Of Criminal Trial
The former president was called out on social media for a brazen new boast.
2 days ago
Yahoo Finance AU
Apprentice tradie reveals 'ridiculous' $150,000 a year salary and explains why it's so high
He's only in his second year of his apprenticeship and he's already pulling in six figures.
2 days ago

ALL ORDS

AUD/USD

ASX 200

OIL

GOLD

Bitcoin AUD

CMC Crypto 200

DeepMind's latest AI can master games without being told their rules

MuZero represents a likely breakthrough in general-purpose AI.

Latest stories

Julia Fox criticised over 'disgusting' vagina bikini outfit by FGM campaigners

Ex-Aide Sums Up Donald Trump’s Attitude To Melania Trump With 3 Words

‘Dripping blood’: More knife terror hits shops

Viewers Think Laura Ingraham Just Made A Big Admission Of Guilt For Trump

Pair found dead at home identified

Trump Is Already Losing in Court—and the Judge Isn’t Playing

Zendaya's itsy-bitsy, retro halter dress is a serve

Travis Head makes history in IPL as cricket world erupts over record-breaking scenes

Controversial TikTok personality Kyle Marisa Roth dead at 36

Higgins’ fiance breaks his silence on rape

Jason Day hit with ban at Masters after golf officials take exception to Aussie's outfit

‘My hoo haa is gonna be out’: Nike’s US Olympic outfits need ‘constant pube vigilance’ say frustrated athletes

Liar, rapist: Ten 'vindicated' in Lehrmann verdict

Gloves Off! DA Wants Trump Punished for Contempt on Day 1

Prime Minister offers Bondi's 'bollard man' Australian citizenship after 'extraordinary bravery'

Mum unable to use her car for hours after 'frustrating' parking act

Latrell Mitchell picks Souths over State of Origin as NRL star misses NSW meeting

Hannah Waddingham Tells Photographer ‘Don’t Be a D—’ After Red Carpet Comment: ‘You Would Never Say That to a Man’

Critics Trash Trump For Most ‘Deranged’ Claim Yet Ahead Of Criminal Trial

Apprentice tradie reveals 'ridiculous' $150,000 a year salary and explains why it's so high