MBZUAI’s PAN Model Enhances AI's Ability To Simulate And Reason Over Time
The Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) has unveiled PAN, a model designed to enhance AI's ability to understand and simulate the evolving world. This development signifies progress towards AI systems capable of reasoning, predicting, and planning. PAN differs from traditional video generation tools by maintaining continuity over time, integrating visual understanding with reasoning to follow natural language instructions.
At the heart of PAN is the Generative Latent Prediction (GLP) framework. This framework distinguishes between what occurs in a scene and its appearance. Initially, the model creates an internal latent state—a structured representation that retains memory of scene elements and their movements based on prior context and user directives. It then translates this latent state into a brief video segment, ensuring visual coherence over extended periods.

PAN's benchmarking results demonstrate its superior performance among open-source systems in three critical areas: action simulation fidelity, long-horizon forecasting, and simulative reasoning and planning. These capabilities pave the way for advancements in robotics, autonomous systems, and decision-support research, where understanding action impacts is crucial.
Developed by MBZUAI’s Institute of Foundation Models (IFM), PAN embodies the institute’s commitment to creating impactful and scientifically rigorous foundation models. IFM fosters collaboration among leading experts from Abu Dhabi, Paris, and Silicon Valley to drive responsible AI innovation and address global challenges through joint efforts.
Unlike conventional video generation tools that produce short clips, PAN ensures continuity across time. It combines visual comprehension with reasoning to execute natural language instructions like "drive through a snowy forest" or "walk toward the lighthouse," generating sequences that remain consistent from one moment to the next.
The introduction of PAN marks a significant step forward in AI technology. By enabling systems to reason, predict, and plan effectively, it opens new possibilities for various applications. The model's ability to maintain logical sequences over time sets it apart from existing technologies.
PAN's development reflects MBZUAI’s mission to advance AI research with global impact. By connecting talent across different regions, the institute aims to foster innovation that addresses pressing global issues responsibly. This collaborative approach ensures that AI advancements are both scientifically sound and socially beneficial.
As AI continues to evolve, models like PAN highlight the potential for more sophisticated systems capable of understanding complex scenarios. The integration of visual understanding with reasoning represents a significant advancement in AI technology. This progress underscores the importance of ongoing research and collaboration in shaping the future of artificial intelligence.
The conclusion highlights how PAN represents a major advancement in AI technology by enabling systems to reason effectively over time. Its development aligns with MBZUAI’s mission to create impactful models through global collaboration while addressing societal challenges responsibly.
With inputs from WAM