by Bob Yirka , Tech Xplore

Playing from Image Prompts: We can prompt Genie with images generated by text-to-image models, hand-drawn sketches or real-world photos. In each case we show the prompt frame and a second frame after taking one of the latent actions four consecutive times. In each case we see clear character movement, despite some of the images being visually distinct from the dataset. Credit: arXiv (2024). DOI: 10.48550/arxiv.2402.15391

AI researchers at Google’s DeepMind, working with colleagues at the University of British Columbia, have announced the development of Genie, an AI-backed application capable of turning a single image into a playable 2D virtual world.

The team has posted a paper on the arXiv preprint server outlining their work and have also posted an announcement page on DeepMind’s research site.

Two-dimensional video games, such as Super Mario Brothers, allow players to manipulate a character on a video screen as they proceed through a virtual world. In this new effort, the team at DeepMind has automated the process of creating 2D video games by allowing Genie to accept a single image, such as a character in front of an imagined background, and then using it to generate the rest of the game. This was made possible by training it on thousands of hours of video from hundreds of 2D video games.

To create Genie, the team first built an AI application that was able to tokenize video frames into millions of parameters that it could use to build new frames. They then added what they describe as a “latent action model” to make predictions about what a given next scene might look like based on the current image.

Next, they added a module to generate a dynamic model to make guesses about possible next sequences based on what it learned during the training phase. The result is a series of frames linked together to form what looks like a 2D virtual world.

Credit: Google DeepMind

The researchers acknowledge that Genie is still very much a work in progress. It has several limitations not easily seen in the examples provided. It takes a very long time to run, for example—it is approximately 20 to 30 times slower than what the average player would consider normal speed. It also makes a lot of mistakes—it can create unrealistic worlds that are not playable, for example. It is also currently limited in scope—it can only run 16 frames at a time.

Still, the team at DeepMind suggests that Genie demonstrates a new step forward in video game development, allowing users to generate their own games based on their own unique preferences.

More information:
Jake Bruce et al, Genie: Generative Interactive Environments, arXiv (2024). DOI: 10.48550/arxiv.2402.15391

Genie: Generative Interactive Environments: sites.google.com/view/genie-2024/home anddeepmind.google/research/publications/60474/

Journal information:
arXiv

Post Disclaimer

The information provided in our posts or blogs are for educational and informative purposes only. We do not guarantee the accuracy, completeness or suitability of the information. We do not provide financial or investment advice. Readers should always seek professional advice before making any financial or investment decisions based on the information provided in our content. We will not be held responsible for any losses, damages or consequences that may arise from relying on the information provided in our content.

DeepMind demonstrates Genie, an AI app that can generate playable 2D worlds from a single image

Post Disclaimer

AI Infrastructure and Compute Strategy for 2026

Operationalizing Responsible AI for 2026 Enterprises

AI and Machine Learning Enterprise Readiness in 2026

Most Popular

Digital twins become the simulation engine of supply chains

What’s ahead with Ai for the Supply Chain Industry

AI: The Maestro of Modern Supply Chains in 2025

Seagate Supply Chain Goes Live With Adexa | Adexa

Recent Comments

EDITOR PICKS

Cloud-First IAM Solutions and Platform Consolidation

Modular blockchains: Unbundling the stack to scale Web3

Real-time payments and AI settlement acceleration in 2026

POPULAR POSTS

Metasurfaces, Mirrors and Multi-Junctions: The Physics Engine Behind Laser WPT

Energy Efficiency & Advanced Cooling Take Centre Stage for AI Servers and Workstations

Top 5 features of Google Pixel 7 that make it a must-have.

POPULAR CATEGORY

ABOUT TECH ONLINE NEWS

FOLLOW US