31.05.2025 09:03

ByteDance Unveils BAGEL: A GPT-4o-Level Image Generator for Local Installation

News image

Chinese tech giant ByteDance has announced the release of BAGEL, a powerful multimodal AI model that rivals the capabilities of GPT-4o and Gemini 2.0, now available as a free demo and detailed on its official site.

BAGEL not only generates and edits images but also analyzes charts and explains photographs while preserving the styles of renowned artworks, making it a versatile tool for creatives and researchers alike.

With 7B active parameters (14B total), BAGEL leverages a Mixture-of-Transformer-Experts (MoT) architecture, trained on vast multimodal datasets. This enables it to perform complex tasks like free-form visual manipulation, chart analysis, and style-consistent image generation, positioning it as a strong competitor to leading proprietary models.

For those interested in running BAGEL locally, ByteDance has made the model accessible through HuggingFace and GitHub. Users can install it by cloning the repository, setting up a Python environment, and downloading the model weights, as outlined in the GitHub instructions.


Also read:

This open-source approach empowers users to harness GPT-4o-level image generation and editing capabilities on their own hardware, democratizing access to cutting-edge AI technology.


0 comments
Read more