国产高清av首播原创麻豆_麻豆黄色网_成人AV毛片无码免费网站_久色精品_国产色精品_国产成人无码aa片免费看

position: EnglishChannel  > AI ripples> Chinese AI Model Emu3 Handles Text, Image, Video Seamlessly

Chinese AI Model Emu3 Handles Text, Image, Video Seamlessly

Source: Science and Technology Daily | 2024-12-17 15:44:35 | Author: Gong Qian

On October 21, the Beijing Academy of Artificial Intelligence (BAAI), a Chinese non-profit organization engaged in AI R&D, released Emu3, a multimodal AI model that seamlessly integrates text, image, and video modalities into a single, unified framework.

The BAAI research team said Emu3 is expected to be used in scenario applications such as robot brains, autonomous driving, multimodal dialogue and inference.

Emu3, based solely on next-token prediction, proves that next-token prediction can be a powerful paradigm for multimodal models.

The existing multimodal AI models are mostly designed for specific tasks. Each has its corresponding architecture and methods. For instance, in the field of video generation, many developers use the diffusion in time (DiT) architecture, as referenced by Sora. Other models such as Stable Diffusion are used for text-to-image synthesis, Sora for text-to-video conversion, and GPT-4V for image-to-text generation.

In contrast to these models, which have a combination of isolated skills rather than an inherently unified ability, Emu3, eliminates the need for diffusion or compositional approaches. By tokenizing images, text, and videos into a discrete space, BAAI has developed a single transformer from scratch.

Emu3 outperforms several well-established task-specific models in both generation and perception tasks, surpassing flagship models such as SDXL and LLaVA.

In September, BAAI open-sourced the key technologies and models of Emu3 including the chat model and generation model after supervised fine-tuning.

Emu3 has been receiving rave reviews from overseas developers. "For researchers, a new opportunity has emerged to explore multimodality through a unified architecture, eliminating the need to combine complex diffusion models with large language models. This approach is akin to the transformative impact of transformers in vision-related tasks," AI consultant Muhammad Umair said on social media platform Meta.

While next-token prediction is considered a promising path towards artificial general intelligence, it struggled to excel in multimodal tasks, which were dominated by diffusion models such as Stable Diffusion and compositional approaches like CLIP combined with large language models.

Raphael Mansuy, co-founder of QuantaLogic, an AI agent platform, thinks that Em3 has significant implications for Al development. Mansuy wrote on X that Em3's success suggests several key insights: Next-token prediction as a viable path to general multimodal Al; potential for simplified and more scalable model architectures; challenge to the dominance of diffusion and compositional approaches.

Editor:GONG Qian

Top News

Forging a Resilient Economy with Sci-tech Power

Tiangong Ultra, developed by the Beijing Humanoid Robot Innovation Center, won the world's first half-marathon for humanoid robots in Beijing on April 19, demonstrating the prospects of China's humanoid robot industry and the epitome of the country's strategic emerging industries and future industries. These industries are surging ahead, facilitating the construction of a resilient economy with sci-tech force.

抱歉,您使用的瀏覽器版本過低或開啟了瀏覽器兼容模式,這會影響您正常瀏覽本網(wǎng)頁

您可以進行以下操作:

1.將瀏覽器切換回極速模式

2.點擊下面圖標升級或更換您的瀏覽器

3.暫不升級,繼續(xù)瀏覽

繼續(xù)瀏覽
主站蜘蛛池模板: 一级做a爱片性色毛片www | 国产成人亚洲综合青青 | 中文字幕高清 | 国产高清av免费观看 | 色网站综合 | 国产精品成人永久在线 | av免费在线观看免费 | 久久亚洲精品无码观看不卡 | 简单av在线| 国精品产一区二区三区在线播放 | 91精品久久久久久综合 | 中文字幕亚洲欧洲 | 婷婷在线免费公开视频 | 国产精品嫩草影院免费 | 亚洲一区 国产一区 | 中文字幕在线播 | 亚洲第一页在线视频 | av黄色免费在线观看 | 三年片在线观看免费动漫 | 国产婷婷激情综合三区 | 91亚洲国产成人 | 久久久久97国产 | 精品国产一区二区三区四区在线观看 | 色视频免费观看 | 日韩欧一区二区三区 | 国产91综合一区在线观看 | 啊灬啊灬啊灬快灬高潮少妇 | 亚洲一区二区三区在线观看网站 | 在线看高清中文字幕一区 | 成人在线视频网站 | 亚洲乱码中文字幕综合区 | 久久久久国产精品熟女影院浪 | 国产人成精品香港三级在线 | 精品1区 | 国产一区三区在线播放 | 成人综合网久久久久久 | av免费提供 | 亚洲精品久久久久久久不卡四虎 | 色网站在线观看视频 | 99精品免费久久久久久久久日本 | 亚洲精品中文字幕乱码三区不卡 |