Google DeepMind’s new generative model makes Super Mario–like games from scratch (2024)

OpenAI’s recent reveal of its stunning generative model Sora pushed the envelope of what’s possible with text-to-video. Now Google DeepMind brings us text-to-video games.

The new model, called Genie, can take a short description, a hand-drawn sketch, or a photo and turn it into a playable video game in the style of classic 2D platformers like Super Mario Bros. But don’t expect anything fast-paced. The games run at one frame per second, versus the typical 30 to 60 frames per second of most modern games.

“It’s cool work,” says Matthew Guzdial, an AI researcher at the University of Alberta, who developed a similar game generator a few years ago.

Genie was trained on 30,000 hours of video of hundreds of 2D platform games taken from the internet. Others have taken that approach before, says Guzdial. His own game generator learned from videos to create abstract platformers. Nvidia used video data to train a model called GameGAN, which could produce clones of games like Pac-Man.

Nvidia trained GameGAN with input actions (such as button presses on a controller), as well as video footage: a video frame showing Mario jumping was paired with the Jump action, and so on. Tagging video footage with input actions takes a lot of work, which has limited the amount of training data available.

In contrast, Genie and Guzdial's model were both trained on video footage alone. Guzdial's model learned level layouts and game rules, represented in code. In Genie's case, the generative model learned a visual representation, which allows it to turn starter images into game levels. This approach turns countless hours of existing online video into potential training data.

Google DeepMind’s new generative model makes Super Mario–like games from scratch (1)

GOOGLE DEEPMIND

Genie learned which of eight possible actions would cause the game character in a video to change its position. It generates each new frame of the game on the fly depending on the action the player takes. Press Jump, and Genie updates the current image to show the game character jumping; press Left and the image changes to show the character moved to the left. The game ticks along action by action, each new frame generated from scratch as the player plays.

Future versions of Genie could run faster. “There is no fundamental limitation that prevents us from reaching 30 frames per second,” says Tim Rocktäschel, a research scientist at Google DeepMind who leads the team behind the work. “Genie uses many of the same technologies as contemporary large language models, where there has been significant progress in improving inference speed.”

Genie learned some common visual quirks found in platformers. Many games of this type use parallax, where the foreground moves sideways faster than the background. Genie often adds this effect to the games it generates.

While Genie is an in-house research project and won’t be released, Guzdial notes that the Google DeepMind team says it could one day be turned into a game-making tool—something he’s working on too. “I’m definitely interested to see what they build,” he says.

Virtual playgrounds

But the Google DeepMind researchers are interested in more than just game generation. The team behind Genie works on open-ended learning, where AI-controlled bots are dropped into a virtual environment and left to solve various tasks by trial and error (a technique known as reinforcement learning).

In 2021, a different DeepMind team developed a virtual playground called XLand, in which bots learned how to cooperate on simple tasks such as moving obstacles. Sandboxes like XLand will be crucial for training future bots on a range of different challenges before pitting them against real-world scenarios. The video-game examples prove that Genie could be used to generate such virtual playgrounds.

Others have developed similar world-building tools. For example, David Ha at Google Brain and Jürgen Schmidhuber at the AI lab IDSIA in Switzerland developed a tool in 2018 that trained bots in game-based virtual environments called world models. But again, unlike Genie, these required the training data to include input actions.

The team demonstrated how this ability is useful in robotics, too. When Genie was shown videos of real robot arms manipulating a variety of household objects, the model learned what actions that arm could do and how to control it. Future robots could learn new tasks by watching video tutorials.

“It is hard to predict what use cases will be enabled,” says Rocktäschel. “We hope projects like Genie will eventually provide people with new tools to express their creativity.”

Correction: This article has been updated to clarify that Genie and XLand were developed by different teams and to clarify the similarities between Genie and Guzdial's existing work.

Google DeepMind’s new generative model makes Super Mario–like games from scratch (2024)

FAQs

Google DeepMind’s new generative model makes Super Mario–like games from scratch? ›

Now Google DeepMind brings us text-to-video games. The new model, called Genie, can take a short description, a hand-drawn sketch, or a photo and turn it into a playable video game in the style of classic 2D platformers like Super Mario Bros. But don't expect anything fast-paced.

What does Google DeepMind do? ›

Google DeepMind is an Alphabet subsidiary focusing on artificial intelligence, machine learning, and neuroscience research. Since its beginnings in 2010, the company has developed AI systems including identifying eye diseases faster and playing complex games like chess, Go, and Shogi.

How much did DeepMind cost to buy? ›

Jaan Tallinn was an early investor and an adviser to the company. On 26 January 2014, Google confirmed its acquisition of DeepMind for a price reportedly ranging between $400 million and $650 million. and that it had agreed to take over DeepMind Technologies.

Who owns Google DeepMind? ›

What is the Google DeepMind controversy? ›

In 2015, Google's AI firm DeepMind was given the personal records of 1.6 million patients at the Royal Free London NHS Foundation Trust. The law firm handling the case said it was launched to address public concerns about the use of private health data by tech firms.

Is DeepMind owned by Elon Musk? ›

Does Elon Musk own DeepMind? No, Elon Musk is not one of the owners of DeepMind. While Musk has been involved in the AI industry through ventures like OpenAI and now xAI, he is not associated with DeepMind's ownership.

How do I play Google secret games? ›

Most hidden Google games can be found simply by Googling them. Type "Snake" into the search bar and hit enter and you will be taken to a page of search results with the game Snake perched at the top. Hit "Play" and the gaming commences.

Can AI recreate a game? ›

Google DeepMind recently introduced Genie, an artificial intelligence (AI) model that can create interactive video games from just a prompt or image. This development allows new possibilities for game design and interaction.

Which Google AI makes games? ›

The company's AI research lab, Deepmind, recently announced an AI model that learned to craft 2D video games by analyzing internet videos. Once trained, the only assets a human need provide is a single image. Even a napkin drawing will do.

Who owns Google now? ›

Google is an American search engine company, founded in 1998 by Sergey Brin and Larry Page. Since 2015, Google has been a subsidiary of the holding company Alphabet, Inc.

Who is the competitor of DeepMind? ›

Top Competitors and Alternatives of DeepMind

The top three of DeepMind's competitors in the Artificial Intelligence category are Optimole with 65.61%, OpenAI with 13.88%, ARKit with 3.55% market share.

What is Google's AI called? ›

How Google's AI model Gemini got its name.

How does Google DeepMind make money? ›

How does DeepMind make money? In essence, DeepMind has a very distinctive revenue model: prior to being acquired by Google in 2014, it made all of its money by selling the technologies it developed to businesses and companies. Currently, DeepMind generates revenue by using its technology in other Alphabet initiatives.

What is the average salary for Google DeepMind? ›

The average Google DeepMind hourly pay ranges from approximately £27 per hour (estimate) for a Laborer to £50 per hour (estimate) for a Research Lab Manager. Google DeepMind employees rate the overall compensation and benefits package 4.6/5 stars.

What are the goals of DeepMind? ›

DeepMind aims to research and build safe artificial intelligence system to solve intelligence, to advance science and humanity.

What is the difference between Google Brain and DeepMind? ›

The organizational structure and areas of expertise of DeepMind and Google Brain are distinct. Thanks to its top-tier reinforcement learning team, DeepMind consistently ranks among the best in the business. But Google Brain is structured more like well-known corporate research centers like Xerox PARC or Bell centers.

References

Top Articles
Latest Posts
Article information

Author: Msgr. Benton Quitzon

Last Updated:

Views: 6129

Rating: 4.2 / 5 (43 voted)

Reviews: 82% of readers found this page helpful

Author information

Name: Msgr. Benton Quitzon

Birthday: 2001-08-13

Address: 96487 Kris Cliff, Teresiafurt, WI 95201

Phone: +9418513585781

Job: Senior Designer

Hobby: Calligraphy, Rowing, Vacation, Geocaching, Web surfing, Electronics, Electronics

Introduction: My name is Msgr. Benton Quitzon, I am a comfortable, charming, thankful, happy, adventurous, handsome, precious person who loves writing and wants to share my knowledge and understanding with you.