Skip to main content

Cookie Consent

We use cookies to enhance your browsing experience, serve personalised ads or content, and analyse our traffic. Learn more

Install AIinASIA

Get quick access from your home screen

AI in ASIA
Qwen AI image generation
News

Qwen launches to take on Google's Nano Banana

Qwen's here to challenge Google's Nano Banana! See how this new contender plans to shake up AI image generation and what it means for you. Read on!

Intelligence Desk6 min read

AI Snapshot

The TL;DR: what matters, fast.

Google's Nano Banana Pro image model set new standards for AI image generation last November, excelling at creating complex, text-heavy visuals.

However, this model is proprietary and integrated into Google's cloud, limiting flexibility and increasing costs for some businesses.

Alibaba's Qwen-Image-2512 now offers an open-source alternative, freely available for commercial use and accessible via various platforms and APIs.

Who should pay attention: AI developers | Enterprises | Open-source advocates

What changes next: Competition in the AI image generation space is likely to escalate.

When Google unveiled its Nano Banana Pro image model, also known as Gemini 3 Pro Image, last November, it significantly reshaped expectations for AI image generation.

This breakthrough allowed users to create complex, text-heavy visuals such as infographics and slides using natural language, largely free from spelling errors.

However, this advance came with a familiar trade-off: Gemini 3 Pro Image is highly proprietary, deeply integrated into Google's cloud infrastructure, and priced for premium use. For businesses requiring predictable costs, deployment autonomy, or regional specialisation, this model set a new benchmark but offered few flexible alternatives.

Now, Alibaba's Qwen AI research team, following a successful year of robust open-source AI model releases, has introduced its own solution: Qwen-Image-2512.

This model is freely available to developers and even large enterprises for commercial applications under the permissive Apache 2.0 license.

Users can access the model directly through Qwen Chat. Its full open-source weights are available on Hugging Face or ModelScope, and the source code can be inspected or integrated from GitHub.

For those preferring zero-install experimentation, the Qwen team offers hosted demos on Hugging Face and ModelScope. Enterprises needing managed inference can also tap into these generation capabilities via Alibaba Cloud’s Model Studio API.

Responding to Enterprise Needs

The impact of Gemini 3 Pro Image was considerable. Its capacity to generate production-ready diagrams, slides, and multilingual visuals propelled image generation beyond creative experimentation and into core enterprise infrastructure. This development aligns with broader discussions around AI orchestration, data pipelines, and security. In this context, image models are evolving from artistic tools into essential workflow components, expected to integrate seamlessly into documentation, design, marketing, and training platforms with consistent performance and control.

Many responses to Google's offering have been proprietary, featuring API-only access, usage-based pricing, and tight platform integration, similar to OpenAI's GPT Image 1.5 released recently. Qwen-Image-2512, though, adopts a different philosophy. It posits that performance parity combined with open access is precisely what a significant portion of the enterprise market desires.

Key Improvements in Qwen-Image-2512

The December 2512 update focuses on three critical areas for enterprise image generation:

  • Human realism and environmental coherence: Qwen-Image-2512 markedly reduces the "AI look" often seen in open models. Facial features exhibit more accurate age and texture, postures align better with prompts, and background environments are rendered with improved semantic context. This realism is vital for businesses using synthetic imagery in training, simulations, or internal communications, enhancing credibility.
  • Natural texture fidelity: Landscapes, water, animal fur, and various materials are rendered with finer detail and smoother gradients. These enhancements are not merely aesthetic; they enable the creation of synthetic imagery for e-commerce, education, and visualisation without extensive manual post-processing.
  • Structured text and layout rendering: Qwen-Image-2512 boasts improved embedded text accuracy and layout consistency, supporting both Chinese and English prompts. Slides, posters, infographics, and mixed text-image compositions are more legible and adhere more closely to instructions. This is an area where Gemini 3 Pro Image received considerable praise, and where many earlier open models struggled.

In blind, human-evaluated tests conducted on Alibaba’s AI Arena, Qwen-Image-2512 emerged as the strongest open-source image model, remaining competitive even with closed systems.

This reinforces its position as a viable, production-ready option rather than merely a research preview. For more insights into how AI models are evolving, you might find our article on OpenAI says human adoption not new models is the key to achieving AGI insightful.

The Open-Source Advantage for Deployment

Qwen-Image-2512's primary differentiator is its licensing. Released under Apache 2.0, the model can be freely used, modified, fine-tuned, and deployed commercially. This offers enterprises several advantages that proprietary models cannot match:

  • Cost control: At scale, per-image API pricing can quickly become prohibitive. Self-hosting allows organisations to amortise infrastructure costs rather than incurring perpetual usage fees.
  • Data governance: Regulated sectors often demand stringent control over data residency, logging, and auditability.
  • Localisation and customisation: Teams can adapt models for regional languages, cultural norms, or internal style guides without relying on a vendor's roadmap.

In contrast, while Gemini 3 Pro Image provides strong governance assurances, it remains intrinsically linked to Google’s infrastructure and pricing model.

API Pricing for Managed Deployments

For teams preferring managed inference, Qwen-Image-2512 is available via Alibaba Cloud Model Studio as qwen-image-max, priced at $0.075 per generated image. The API accepts text input and returns image output, with rate limits suitable for production workloads. There are limited free quotas, after which usage transitions to paid billing. This hybrid approach, combining open weights with a commercial API, reflects how many enterprises currently deploy AI: internal experimentation and customisation, supplemented by managed services where operational simplicity is paramount. This strategy is also evident in other AI tools like ChatGPT Now Creates Sharper Images, Quicker.

A Competitive, Yet Philosophically Distinct, Offering

Qwen-Image-2512 isn't positioned as a direct, universal replacement for Gemini 3 Pro Image. Google’s model benefits from deep integration with Vertex AI, Workspace, Ads, and the broader Gemini reasoning stack. For organisations already invested in Google Cloud, Nano Banana Pro fits naturally into existing workflows.

Qwen’s strategy is more modular. The model integrates cleanly with open tooling and custom orchestration layers, making it appealing to teams building their own AI stacks or combining image generation with internal data systems. This approach aligns with the growing trend of customisation, as seen with tools like Customise ChatGPT's tone: warmth, enthusiasm, structure.

A Clear Market Signal

The launch of Qwen-Image-2512 underscores a significant shift: open-source AI is no longer merely playing catch-up with proprietary systems. Instead, it's selectively matching the capabilities most crucial for enterprise deployment, including text fidelity, layout control, and realism.

Simultaneously, it preserves the freedoms that businesses increasingly value, such as control over their data and infrastructure. A recent report by the National Academies of Sciences, Engineering, and Medicine highlights the growing importance of open-source models in advancing AI research and deployment, particularly for fostering innovation and addressing ethical considerations National Academies Press.

Google’s Gemini 3 Pro Image certainly raised the bar. Qwen-Image-2512, however, demonstrates that enterprises now have a robust open-source alternative, one that effectively balances performance with cost control, governance, and deployment flexibility.

What are your thoughts on the increasing competition between proprietary and open-source AI models? Share your perspective in the comments below.

YOUR TAKE

We cover the story. You tell us what it means on the ground.

What did you think?

Written by

Share your thoughts

Be the first to share your perspective on this story

This is a developing story

We're tracking this across Asia-Pacific and may update with new developments, follow-ups and regional context.

Guides & Tutorials

Master AI tools with step-by-step learning resources

View All Guides
Semiconductor wafer with Taiwan tech industry facilities, circuit design patterns visible

AI for Taiwan's Semiconductor and Tech Industry Professionals

Master AI applications specifically for semiconductor manufacturing, design, and engineering in Taiwan's world-leading tech industry

intermediate
AI in Malaysia: Your Guide to Malaysia's Growing AI Ecosystem - AI in Asia guide

AI in Malaysia: Your Guide to Malaysia's Growing AI Ecosystem

Discover Malaysia's fast-growing AI ecosystem. From the National AI Strategy to homegrown startups and multilingual AI challenges, learn how Malaysia is positioning itself as Southeast Asia's AI hub.

beginner
Marketing analytics dashboard with Taiwan social media platforms, audience data, and campaign metrics

AI-Powered Marketing for Taiwan's Unique Digital Landscape

Leverage AI to create marketing campaigns that resonate authentically with Taiwan audiences across all major digital platforms

intermediate
Taiwan creative workspace with design tools, music production setup, and media creation equipment

AI and Taiwan's Creative Economy: Design, Music and Media

Leverage AI tools to amplify your creative career in Taiwan's dynamic design, music, and media ecosystem

intermediate
Person studying Mandarin Chinese with Traditional characters, Taiwan cultural artifacts visible

AI Tools for Learning Traditional Chinese and Taiwanese Culture

Accelerate your Mandarin learning and cultural understanding with AI tutors customised to Taiwan's language, history, and living culture

beginner
Taiwan 7-Eleven storefront, MRT station, payment technology and digital convenience services

Everyday AI for Life in Taiwan: From 7-Eleven to MRT

Master Taiwan's AI-powered everyday conveniences - from smart shopping to seamless transport - and live more efficiently in Taiwan's tech ecosystem

beginner

Liked this? There's more.

Join our weekly newsletter for the latest AI news, tools, and insights from across Asia. Free, no spam, unsubscribe anytime.

Loading comments...