谷歌Nano Banana Pro升级企业级图像生成功能

内容来源:https://aibusiness.com/generative-ai/google-nano-banana-pro-image-gen-for-enterprises
内容总结:
谷歌发布新一代图像生成模型Nano Banana Pro,AI创意工具实现多维度升级
谷歌公司于11月20日正式推出新一代图像生成与编辑模型Nano Banana Pro。该模型基于Gemini 3 Pro架构构建,通过融合搜索引擎知识库与多模态推理能力,在信息可视化和创意设计领域实现显著突破。
此次升级主要体现在三大核心功能:首先,模型生成图像的准确性获得提升,能基于谷歌搜索知识库生成更符合语境的视觉内容;其次,突破性地解决了AI生成图像中文字易模糊的行业难题,确保图文融合的清晰度;第三,局部编辑功能支持用户对图像任意区域进行精细化调整,并实现多达14个图像元素的智能融合。此外,该模型还提供摄像机视角调节、焦点变更及色彩分级等专业级后期处理功能。
行业分析师布拉德利·希明指出,这项技术创新体现了"文学化编程"理念的延伸——将编程语言与文档语言相结合,构建涵盖代码、文本与视觉呈现的全流程创作体系。以影视导演的故事板创作为例,该模型不仅能生成分镜图,还可同步完成视频生成与脚本撰写,实现多模态创意任务的有机整合。
针对AI工具可能削弱人类批判性思维的担忧,希明以计算器类比强调:"它既是思维能力的延伸载体,也是加速引擎,关键在于使用者如何驾驭这种双重特性。"
目前,该模型已通过Gemini应用程序向普通用户开放,专业版和Ultra订阅用户享有更高额度权限。针对企业用户,谷歌将在广告平台、Workspace套件中的Slides和Vids组件集成该技术,开发者则可通过Gemini API和Google AI Studio进行调用。企业级用户还可基于Vertex AI生成式人工智能平台开展深度开发应用。
(注:本文提及的Nano Banana Pro为示例模型名称,实际产品信息请以官方发布为准)
中文翻译:
本文由谷歌云赞助
选择首个生成式AI应用场景
迈入生成式AI领域,首先应关注那些能够优化人类信息交互体验的环节。该模型具备局部编辑、可调拍摄角度、清晰文本处理等特性,旨在简化从原型设计到信息图制作的全流程创意工作。谷歌Nana Banana Pro的推出表明,图像生成AI模型不仅在画质上持续精进,更在情境关联性上取得突破。
11月20日,谷歌正式发布图像生成编辑模型Nano Banana Pro。该版本基于Gemini 3 Pro架构,借助Gemini的推理与知识体系,实现了从概念原型到信息图表的多维度视觉化创作。相较于前代产品,新模型通过整合谷歌搜索知识库,生成的图像精准度显著提升。特别在文字渲染方面取得突破性进展——过往图像中易模糊的文本如今可清晰呈现。谷歌透露,当前版本支持最多14张图像的融合创作,并升级了局部编辑功能,用户可对任意图像区域进行精准选择、优化与变形处理,同时实现拍摄视角调节、焦点切换及色彩分级等专业操作。
深耕图像生成领域的企业不止谷歌。Adobe Firefly支持用户在Photoshop和Illustrator中直接编辑生成图像,其他厂商亦掌握视角调控技术——例如Stability AI今年初推出的Stable Virtual Camera就实现了多视角视频生成与镜头控制。
创意生成的艺术
Futurum Group分析师布拉德利·希明指出,谷歌对Nano Banana Pro的升级契合"文学化编程"理念。这一由唐纳德·克努特于1984年提出的方法论,强调编程语言与文档语言的深度融合。希明表示,谷歌正通过模型效果图等技术手段,将这一理念注入Nano Banana的开发流程。
"这不仅是对软件运行机制的记录,更是借助图像进行迭代开发的全新范式。它构建了一个兼具部署与迭代能力的完整代理流程,使创意呈现不再局限于代码或标记文本,更延伸至视觉化表达层面。"该技术特别适用于导演的故事板创作,Nano Banana Pro既能生成分镜图,也可同步完成视频制作与剧本撰写。
希明强调:"在这个多模态技术蓬勃发展的时代,它将诸多差异显著又紧密关联的创意任务融会贯通。"
批判性思维考量
尽管Nano Banana等模型展现出强大创造力,有人担忧这可能削弱人类的批判性思维。但希明认为,此类工具与计算器等历史技术革新具有相似性。
"既可视为思维能力的延伸,也能当作认知效率的加速器。事实上它兼具双重属性——在放大和提速人类思维的同时,也可能弱化那些我们视为人类特质的核心能力。"
应用生态布局
目前普通用户和学生可通过Gemini应用使用Nano Banana Pro,Pro和Ultra订阅用户享有更高额度配额。专业用户将在谷歌广告平台体验升级后的图像生成服务,Workspace客户则可通过Google Slides和Vids调用该模型。针对开发者和企业用户,该模型已登陆Gemini API与Google AI Studio,Vertex AI生成式AI平台也即将开放企业级部署。
拓展阅读
英文来源:
Sponsored by Google Cloud
Choosing Your First Generative AI Use Cases
To get started with generative AI, first focus on areas that can improve human experiences with information.
With features such as localized editing, adjustable camera angles, and the ability to work with legible text, the model aims to ease various creative processes, ranging from prototyping to infographic design.
Google's Nana Banana Pro demonstrates that image-generating AI models are improving not only in generating images but also in grounding them in context.
Google on Nov. 20 introduced Nano Banana Pro, the latest version of its image generation and editing model. It is built on the Gemini 3 Pro model and uses Gemini's reasoning and knowledge capabilities to help visualize information and design images, from prototypes to infographics.
The model enables users to generate visuals using Google Search's knowledge base that are more accurate than produced by the previous generation of Nano Banana Pro. The model is also good at generating images with legible text; previously text in images could be illegible. Users can also blend more elements than previously -- now 14 images, Google said. There is also a capability that lets users select, refine and transform any part of an image with improved localized editing. Users can adjust camera angles, change focus and apply color grading, Google said.
Google is not the only vendor with an image-generating model that allows for localized editing. Adobe Firefly enables users to edit generated images directly within Photoshop and Adobe Illustrator.
Other vendors also have capabilities such as camera angle change. For example, earlier this year, Stability AI introduced Stable Virtual Camera, a multi-view video generation with camera control.
The Art of Generating Ideas
The updates Google made to Nano Banana Pro align with what is known as literate programming, said Bradley Shimmin, an analyst at Futurum Group. Introduced in 1984 by Donald Knuth, literate programming is a method of combining programming language with documentation language.
Google is applying that same idea to Nano Banana with techniques such as mockups, Shimmin said.
"To not just document how the software works but to iterate and build the software using these images to be able to create a fully agentic process of both the deployment and the iteration of an idea that incorporates more than just code or just markdown text, but also includes visual rendering of those ideas," he said.
One practical application of this process is for directors who are trying to storyboard an idea. Not only can Nano Banana Pro create a storyboard, but users can also use it to create a video and write the script that accompanies it.
"It's taking all these very different but highly interdependent creative tasks and bringing them together using this multimodal world that we now inhabit," Shimmin said.
Critical thinking
Because models like Nano Banana can be highly creative, there may be a perception that they hinder critical thinking.
However, for Shimmin, the tool is similar to other technologies that came before it, such as the calculator.
"You can view it as an offloading of mental faculty, you could view it as an accelerator of mental faculty, and in truth, it's both," he said. "It amplifies, accelerates, but also can dampen, and curtail those very aspects that we come to think of as being uniquely human."
Consumers and students can access Nano Banana Pro now on the Gemini app. Pro and Ultra subscribers have access to higher quotas when creating models. Professionals have access to Google Ads as Google will update the image generation there to Nano Banana Pro. It is also available to Workspace customers in Google Slides and Vids. The model is available in the Gemini API and Google AI Studio to developers and enterprises. Enterprises can also start building with it on the Vertex AI generative AI platform, with the model expected to be available soon in Gemini Enterprise.
You May Also Like
文章标题:谷歌Nano Banana Pro升级企业级图像生成功能
文章链接:https://blog.qimuai.cn/?post=2179
本站文章均为原创,未经授权请勿用于任何商业用途