Sunday, 19 January 2025
25.9 C
Singapore

New AI model developed for high-resolution video generation

A Chinese research team has developed an open-source AI model, Pyramid Flow, for cost-effective, high-resolution video generation at 768p.

Researchers from Peking University, Kuaishou Technology, and Beijing University of Posts and Telecommunications have made significant progress in AI video generation. Their new AI model, Pyramid Flow, promises to revolutionise the way high-resolution virtual videos are created.

Unlike many proprietary models that require expensive resources and are often difficult to access, the team behind Pyramid Flow has chosen to make their model open-source. This move allows developers and users worldwide to access the technology freely, allowing a broader audience to experiment with and use it for various purposes.

Pyramid Flow’s cost-effective approach to high-resolution video generation

Pyramid Flow takes an innovative approach by generating videos through low-resolution stages before reaching the final high-resolution output. This multi-stage process helps to significantly reduce the computing power needed to run the model, making it more affordable and practical for users. The team claims that Pyramid Flow can produce a five-second video clip at 384p resolution in just 56 seconds, demonstrating the efficiency of their model.

One of Pyramid Flow’s most notable advantages is its ability to create high-quality, detailed imagery. The model has been shown to generate lifelike visuals, including complex scenes like underwater explosions that produce bubbles and splashing water. This level of realism is an exciting breakthrough for the AI video generation community, especially given its low cost.

Open-source availability and potential concerns

Along with the model, the team has made the source code available under the MIT License. This means that anyone can download, modify, and use the software for personal and commercial purposes without worrying about licensing fees or restrictions. The team has also provided several sample videos showcasing the impressive output quality of the model.

Additionally, the research team has made the datasets used to train Pyramid Flow available to the public. These datasets consist of approximately 10 million short videos, allowing other developers to build upon and improve the model in the future.

However, using open-source datasets in AI video generation has raised some concerns. Critics argue that such practices could infringe on the intellectual property rights of copyright holders. While the team behind Pyramid Flow has yet to address these concerns directly, they have suggested that their model could be a valuable tool for fine-tuning open-source material. This would help reduce reliance on third-party sources, alleviating some copyright concerns.

Pyramid Flow represents a significant leap forward in AI video generation technology. It offers both high-quality output and an open-source approach that could open up new possibilities for developers and creators. The cost-effective nature of the model and the free access to the underlying code and datasets could reshape the way AI-generated videos are used across industries, making high-resolution video creation more accessible than ever.

Hot this week

Square Enix announces PC specs for Final Fantasy VII: Rebirth

Square Enix reveals PC specs for Final Fantasy VII: Rebirth, offering tailored settings from basic 1080p to 4K visuals with NVIDIA RTX 50 upgrades.

ChatGPT’s head of product to testify in US antitrust case against Google

ChatGPT’s head of product, Nick Turley, will testify in the US government’s antitrust case against Google, addressing AI and competition issues.

OPPO partners with Mobile Legends: Bang Bang for a smooth gaming experience on the Reno13 series

OPPO partners with Mobile Legends: Bang Bang for the Reno13 Series, unveiling the MLBB x OPPO Smooth Legend Cup with prizes worth US$10,000+.

Bioptimus secures US$41M to create groundbreaking AI for biology

French startup Bioptimus raises US$41M to develop AI that simulates biological processes, driving medical, biotech, and cosmetic innovations.

More applicants but harder to hire: LinkedIn highlights hiring challenges in 2025

LinkedIn's 2025 research highlights hiring struggles in APAC, driven by a skills mismatch, rising AI demands, and new tools to address these challenges.

ASUS introduces ProArt Display 5K PA27JCV for creative professionals

ASUS unveils the ProArt Display 5K PA27JCV, a 27-inch monitor offering 5K resolution, Delta E<2 colour accuracy, and advanced features for creators.

Character AI tests games on its platform to boost user engagement

Character AI introduces games to its platform to boost user engagement and enhance its entertainment offerings.

Canoo files for bankruptcy, ending seven years of EV innovation

Canoo, a seven-year-old EV startup, filed for bankruptcy and ceased operations after failing to secure funding.

Perplexity acquires Read.cv, a professional networking platform

Perplexity acquires professional networking platform Read.cv, ending its operations. Users can export data until May 16 as domains shift to Hello.cv.