Google Launches Gemini 3-Backed Image Generation Model Nano Banana Pro

Nano Banana Pro delivers what Google describes as "studio-quality levels of precision and control" through enhanced planning of text placement, font characteristics, and spatial relationships.

TMTPOST -- Google's latest artificial intelligence (AI) image model addresses longstanding industry challenges in text accuracy and professional-grade editing capabilities. Powered by the tech giant's most advanced reasoning system, the rollout signals intensifying competition with OpenAI as both companies race to monetize generative AI technology.

AI Generated Image

AI Generated Image

The search giant unveiled Nano Banana Pro on Thursday, built on the Gemini 3 model released just two days earlier. The new image generation and editing tool offers enhanced text rendering, higher resolutions up to 4K, and expanded control over visual elements—features designed to appeal to both consumer users and professional designers. Google said the update addresses AI's persistent "spelling problem," where models frequently produce distorted text or typos in generated images.

Nano Banana Pro is immediately available across Google's product ecosystem, with free Gemini users receiving limited quotas before reverting to the older model. Paid subscribers to AI Pro and Ultra plans will access higher generation limits. The model integrates with popular design platforms including Canva, Figma, and Adobe Inc.'s Firefly and Photoshop, positioning Google's technology within existing professional workflows.

The announcement follows strong market reception to Tuesday's Gemini 3 launch, which pushed Alphabet shares to record highs on Wednesday. Google's Gemini app currently has over 650 million monthly active users, though it trails OpenAI's ChatGPT, which reported 800 million weekly users in October. The competitive dynamic intensified last week when OpenAI updated its GPT-5 model for more conversational interactions.

Studio-Quality Control for Professional Applications

Nano Banana Pro delivers what Google describes as "studio-quality levels of precision and control" through enhanced planning of text placement, font characteristics, and spatial relationships. A Google spokesperson said the model maps these elements before rendering the final image, enabling applications from transforming recipes into illustrated flowcharts to visualizing real-time data like weather or sports information.

The tool supports generation at 1K, 2K, or 4K resolution, addressing previous limitations where models capped at 1024 x 1024 pixels. Users can specify camera angles, depth of field, color grading, and aspect ratios through text prompts, mimicking professional photography controls. The model can maintain consistency of up to five characters and incorporate up to 14 reference objects in a single workflow—capabilities particularly relevant for brands developing marketing campaigns with specific design assets.

Josh Woodward, vice president of Google Labs and Gemini, told CNBC that internal users have experimented with inputting code snippets and LinkedIn resumes to create infographics. "This ability to visualize things that were previously maybe not something you would think of as a visual medium tends to be one of the magic things people are finding with it," he said.

The original Nano Banana, launched in late August, added 13 million new Gemini app users in four days after going viral for creating hyperrealistic 3D figurines from photos. Nano Banana Pro extends those capabilities with multi-language text generation, translation of text within images, and the ability to generate localized content for international markets.

Pricing Structure and Platform Integration

Google disclosed Nano Banana Pro costs  $0.139 for each 1080p or 2K image, , and $0.24 for every 4K image, compared to $0.039 per 1024px image for the original model. The company acknowledged the new model is slower and more expensive but emphasized enhanced quality justifies the premium for professional applications.

The model is accessible through the Gemini API, Google AI Studio, and the company's new IDE, Antigravity. Workspace customers can use it within Google Slides and Vids. Ultra subscribers will gain access in Flow, Google's AI filmmaking tool. The widespread integration reflects Google's strategy to embed AI capabilities across its product portfolio rather than offering standalone tools.

Developers and enterprise users can tap Nano Banana Pro immediately, while consumer access varies by subscription tier. Woodward said demand for Gemini's subscription plans has grown as users seek "higher limits with some of these advanced models," adding that high demand represents "the best problem to have."

Limitations and Watermarking Safeguards

Google cautioned that Nano Banana Pro retains limitations despite improvements. The company said users should verify data-driven outputs, as the model's real-world knowledge "is extensive but not infallible" and may produce factually incorrect results in infographics or annotated diagrams. Advanced features like masked editing, dramatic lighting changes, or blending multiple images may sometimes yield unnatural results or visual artifacts.

The model can struggle with grammar, spelling, cultural nuances, and idiomatic phrases in non-English languages. Small faces, accurate spelling, and fine details in images remain challenging, and character consistency, while improved, is not yet fully reliable, Google acknowledged.

As part of Thursday's announcement, Google embedded SynthID technology in the Gemini app, allowing users to upload images and determine if they were generated by Google AI. The company plans to expand this capability to audio and video. Google embeds imperceptible digital watermarks on all AI-generated media, with visible watermarks on images created by free and Pro tier users. Ultra subscribers receive content without visible watermarks.

The company did not indicate whether it will support other watermarking standards such as C2PA, which has gained industry backing for authenticating digital content provenance.

转载请注明出处、作者和本文链接
声明:文章内容仅供参考、交流、学习、不构成投资建议。
想和千万钛媒体用户分享你的新奇观点和发现,点击这里投稿 。创业或融资寻求报道,点击这里

敬原创,有钛度,得赞赏

赞赏支持
发表评论
0 / 300

根据《网络安全法》实名制要求,请绑定手机号后发表评论

登录后输入评论内容

快报

更多

17:30

悟空率先接入国产最强编程模型Qwen3.6-Plus

17:26

工信部:积极培育人工智能应用服务商,完善人工智能应用服务体系

17:25

工信部:引导基础电信企业、算力服务企业等各类主体建设面向中小企业的先进存力中心

17:24

工信部:探索“算力银行”“算力超市”等创新业务,支持中小企业存入闲置算力资源

17:23

工信部:鼓励设立中小企业专属算力池

17:21

工信部:到2028年底显著降低中小企业获取、使用算力门槛,为推动中小企业专精特新发展提供坚实算力支撑

17:19

国产软件打破分子动力学模拟规模世界纪录

17:18

商务部部长王文涛主持召开第139届广交会筹备工作会

17:15

中央网信办等三部门开展2026年个人信息保护系列专项行动

17:13

航运数据显示:委内瑞拉3月石油日出口量突破100万桶大关,为六个月来首次

17:12

凤凰传媒:拟8261.64万元挂牌转让凤凰传奇影业61.03%股权及债权

17:10

华钰矿业:控股股东筹划控制权变更事项,股票停牌

17:09

国务院:对符合条件的澳门机动车经横琴粤澳深度合作区暂时入出广东省其他区域实行免担保政策

17:07

生数科技发布通用世界模型战略,推出统一世界模型Motus

17:06

国务院办公厅:鼓励对信用评价等级较高的企业降低抵质押担保要求,逐步扩大信用贷款覆盖面、提升信用贷款比重

17:02

三峡水利:拟预挂牌转让贵州武陵矿业控股权及相关债权

17:01

中东铝厂接连遇袭,全球铝市场或现明显供需缺口

17:00

百度健康发布首款医生任务型AI助手“有医助理”

16:59

新华网:聘任张芮宁为公司总裁

16:59

俄罗斯宣布汽油临时出口限制延长至7月31日

扫描下载App