bytedance announces goku and takes on Google's Luma and Openai's Sora

Bytedance, the parent company of Tiktok, has dropped a family of a collaborative image and video generation model called Goku. The model appears to have been named after the popular anime character Goku from the Dragon Ball series.

This comes right after the company mocked a video AI model that generates videos from images called Omnihuman-1.

Researchers argue that the Goku model can help create product videos featuring AI-generated influencers, marketing avatars, landscape demonstrations, Chinese poetry visualizations, portrait video demos and more .

This research paper attributes the ability of the model to generate high-quality videos into several important factors. One is the implementation of Modification Flow (RF) formulations for joint image and video generation, and the compression of employment of 3D joint image video VAEs into a shared latent space.

Additionally, the architecture features a fully-focused transnetwork, enhanced with techniques such as flashat, sequence parallelism, patch N-pack, 3D rope position embedding, and QK normalization.

This paper also demonstrates that the Goku model demonstrates superior performance in both qualitative and quantitative evaluations, setting new benchmarks compared to competitors such as Luma, Open-Sora, Mira, Pika, etc. It’s there.

Goku achieved 0.76 in Geneval, 83.65 in DPG bench for image generation from text, and 84.85 in VBench for intertext tasks. You can see the benchmark results below.

“We believe this work will provide valuable insights and practical advancements to the research community in developing collaborative image and video generation models,” the researchers said.

The ability of a model to generate high-quality product videos with AI-generated influencers and other realistic visuals can have significant benefits for content creators, influencers, marketers, and more.

Source link

What's Hot

Who is Graham Platner, the oyster farmer running for Maine Senate? | US News

Lessons to learn how to make your code vibrate using AI like ChatGPT

Masala Bond: DBS faces IT-related prosecution over 2019 Masala Bond investment

bytedance announces goku and takes on Google’s Luma and Openai’s Sora

Lessons to learn how to make your code vibrate using AI like ChatGPT

Can Apple’s iPhone be used as a passport? What travelers need to know and how to do it |

SoftBank acquires ABB’s robotics business for $5.4 billion • The Register

20 Most Anticipated Sex Movies of 2025

President Trump’s SEC nominee Paul Atkins marries multi-billion dollar roof fortune

How to tell the difference between fake and genuine Adidas Sambas

Alice Munro’s Passive Voice | New Yorker

AI systems learn from many types of scientific information and run experiments to discover new materials | MIT News

Among the most troublesome relationships in healthcare AI

Does access to AI become a fundamental human right? Sam Altman says, “Everyone would want…”

Google’s Gemini AI is on TV

Our Picks

Who is Graham Platner, the oyster farmer running for Maine Senate? | US News

Lessons to learn how to make your code vibrate using AI like ChatGPT

Masala Bond: DBS faces IT-related prosecution over 2019 Masala Bond investment

Most Popular

10 things you should never say to an AI chatbot

Character.AI faces lawsuit over child safety concerns

Analyst warns Salesforce investors about AI agent optimism

Subscribe to Updates

What's Hot

bytedance announces goku and takes on Google’s Luma and Openai’s Sora

Related Posts