Sunday, 14 September 2025
27.7 C
Singapore
28.6 C
Thailand
20.2 C
Indonesia
28.1 C
Philippines

New AI model developed for high-resolution video generation

A Chinese research team has developed an open-source AI model, Pyramid Flow, for cost-effective, high-resolution video generation at 768p.

Researchers from Peking University, Kuaishou Technology, and Beijing University of Posts and Telecommunications have made significant progress in AI video generation. Their new AI model, Pyramid Flow, promises to revolutionise the way high-resolution virtual videos are created.

Unlike many proprietary models that require expensive resources and are often difficult to access, the team behind Pyramid Flow has chosen to make their model open-source. This move allows developers and users worldwide to access the technology freely, allowing a broader audience to experiment with and use it for various purposes.

Pyramid Flow’s cost-effective approach to high-resolution video generation

Pyramid Flow takes an innovative approach by generating videos through low-resolution stages before reaching the final high-resolution output. This multi-stage process helps to significantly reduce the computing power needed to run the model, making it more affordable and practical for users. The team claims that Pyramid Flow can produce a five-second video clip at 384p resolution in just 56 seconds, demonstrating the efficiency of their model.

One of Pyramid Flow’s most notable advantages is its ability to create high-quality, detailed imagery. The model has been shown to generate lifelike visuals, including complex scenes like underwater explosions that produce bubbles and splashing water. This level of realism is an exciting breakthrough for the AI video generation community, especially given its low cost.

Open-source availability and potential concerns

Along with the model, the team has made the source code available under the MIT License. This means that anyone can download, modify, and use the software for personal and commercial purposes without worrying about licensing fees or restrictions. The team has also provided several sample videos showcasing the impressive output quality of the model.

Additionally, the research team has made the datasets used to train Pyramid Flow available to the public. These datasets consist of approximately 10 million short videos, allowing other developers to build upon and improve the model in the future.

However, using open-source datasets in AI video generation has raised some concerns. Critics argue that such practices could infringe on the intellectual property rights of copyright holders. While the team behind Pyramid Flow has yet to address these concerns directly, they have suggested that their model could be a valuable tool for fine-tuning open-source material. This would help reduce reliance on third-party sources, alleviating some copyright concerns.

Pyramid Flow represents a significant leap forward in AI video generation technology. It offers both high-quality output and an open-source approach that could open up new possibilities for developers and creators. The cost-effective nature of the model and the free access to the underlying code and datasets could reshape the way AI-generated videos are used across industries, making high-resolution video creation more accessible than ever.

Hot this week

Sony showcases connected media ecosystem at IBC 2025

Sony highlights advanced media production tools and sustainable innovation at IBC 2025, showcasing AI, cloud, and virtual production.

Employment Hero report shows workers under pressure and rethinking careers

Employment Hero’s 2025 Jobs Report reveals rising costs, job mobility, and Gen Z’s shift towards security and purpose across four countries.

Maxicare adopts Agentforce to streamline dental authorisations

Maxicare adopts Salesforce’s Agentforce to automate dental authorisations, improving clinic efficiency and member healthcare services.

My Arcade launches Atari Gamestation Go handheld with over 200 retro games

My Arcade unveils the Atari Gamestation Go handheld with over 200 retro games, launching in North America this October for US$179.99.

NVIDIA Blackwell Ultra sets new benchmark in MLPerf inference tests

NVIDIA’s Blackwell Ultra architecture sets new records in MLPerf Inference v5.1, boosting AI performance and reducing costs for enterprises.

Asus unveils US$4,000 ProArt P16 with 4K tandem OLED and RTX 5090

Asus launches its ProArt P16 laptop with a 4K tandem OLED, RTX 5090 GPU, and creator-focused features, priced from US$1,999.

Lenovo unveils Legion Go 2 handheld with OLED display and higher price tag

Lenovo launches the Legion Go 2 handheld with an OLED display, upgraded specs and a higher starting price of €999 at IFA 2025.

Samsung could launch two Galaxy Z Fold8 models in 2026

Samsung may release two Galaxy Z Fold8 models in 2026, including one with a square-like screen, alongside the Galaxy Z Flip8.

Apple brings new health features to older Watch models

Apple adds hypertension notifications and Sleep Score to older Watch models with watchOS 26, expanding health tools beyond its newest devices.

Related Articles

Popular Categories