谷歌Gemini Live人工智能助手将直观展示对话内容
内容来源:https://www.theverge.com/news/763114/google-gemini-live-ai-visual-guidance-speech-update
内容总结:
谷歌旗下人工智能助手Gemini Live即将迎来多项重磅升级。本周起,该助手在共享摄像头画面时可实现实时屏幕标注功能,用户通过手机摄像头对准物品(如维修工具组)即可在屏幕上获得精准视觉指引。此项功能将率先于8月28日上市的Pixel 10系列机型上线,同期向安卓设备推送,iOS版本将于数周后跟进。
本次升级还深度整合了系统级应用生态。用户在获取导航服务时可直接语音中断当前对话,例如指令"路线可行,现在给Alex发消息说我迟到10分钟",系统即会自动生成短信草稿。该功能未来将覆盖短信、电话、时钟等核心应用。
值得注意的是,谷歌同步推出了新一代音频模型,通过模拟人类语言的语调、节奏与音高变化实现更自然的对话交互。新版本能根据谈话内容自动调整语气(如应对压力话题时采用舒缓声线),支持语速自定义,并在角色扮演叙事中运用口音特效增强故事表现力。这些升级与OpenAI的ChatGPT语音定制功能形成直接对标。
(注:原文中涉及索尼PS5涨价、华硕显示器等无关内容已按指令要求过滤)
中文翻译:
谷歌正为其人工智能助手Gemini Live推出一系列新功能,用户可实现与AI的实时对话。下周起,Gemini Live将在共享摄像头画面时支持直接屏幕标注功能,使AI助手能更精准地指示特定对象。
例如当您需要为某个项目挑选合适工具时,只需将智能手机摄像头对准工具组,Gemini Live便会在屏幕上高亮标出正确选项。该功能将随8月28日新发布的Pixel 10系列设备首发上线,同期开始向其他安卓设备推送视觉指引功能,并在"未来数周内"逐步扩展至iOS平台。
谷歌还推出了新的系统集成方案,即将实现Gemini Live与信息、电话、时钟等更多应用的联动。假设您正在与Gemini讨论出行路线时意识到可能迟到,只需打断对话并指示:"这条路线不错,现在给Alex发信息说我大约晚到十分钟",系统即可自动生成待发送的文本内容。
最后,谷歌为Gemini Live升级了音频模型,宣称能"显著提升AI运用语调、节奏、音高等人类语音关键要素的能力"。很快Gemini将能根据谈话内容自动调整语气,例如在讨论压力性话题时采用更舒缓的声线。
用户还可调节Gemini的语速快慢——这与当前ChatGPT语音模式的风格调节功能类似。当要求Gemini以特定角色或历史人物视角戏剧化重述故事时,AI甚至能模仿相应口音以营造"生动迷人的叙事效果"。
其他热门资讯:
- 索尼明日起全面上调PS5售价
- 谷歌Pixel 10/10 Pro搭载磁吸模块、全新芯片及全域AI功能
- 谷歌新款Pixel全系产品售价遭泄露
- 华硕推出720Hz全球最快OLED电竞显示器,设计感十足
- 家庭版Gemini成为谷歌近年最大智能家居战略举措
英文来源:
Google is bringing a bundle of new features to Gemini Live, its AI assistant that you can have real-time conversations with. Next week, Gemini Live will be able to highlight things directly on your screen while sharing your camera, making it easier for the AI assistant to point out a specific item.
If you’re trying to find the right tool for a project, for example, you can point your smartphone’s camera at a collection of tools, and Gemini Live will highlight the correct one on your screen. This feature will be available on the newly announced Pixel 10 devices when they launch on August 28th. Google will begin rolling out visual guidance to other Android devices at the same time before expanding to iOS “in the coming weeks.”
Google is also launching new integrations that will soon allow Gemini Live to interact with more apps, including Messages, Phone, and Clock. Say you’re in the middle of a conversation with Gemini about directions to your destination, but you realize you’re running late. Google says you’ll be able to interrupt the chatbot with something like: “This route looks good. Now, send a message to Alex that I’m running about 10 minutes late.” From there, Google can draft a text to your friend for you.
Lastly, Google is launching an updated audio model for Gemini Live that the company says will “dramatically improve” how the chatbot “uses the key elements of human speech, like intonation, rhythm and pitch.” Soon, Gemini will change its tone based on what you’re speaking about, such as using a calmer voice if you’re asking about a stressful topic.
You’ll also be able to change how fast — or slow — Gemini talks, which sounds a bit similar to how users can now tweak the style of ChatGPT’s voice mode. And, if you ask Gemini for a dramatic retelling of a story from the perspective of a particular character or historical figure, the chatbot may adopt an accent for a “rich, engaging narrative.”
Most Popular
- Sony is raising PS5 prices, starting tomorrow
- The Google Pixel 10 and 10 Pro come with magnets, a new chip, and AI everywhere
- Prices leak for the rest of Google’s new Pixel products
- Asus has the new world’s fastest OLED monitor at 720Hz, and it’s dripping with style
- Gemini for Home is Google’s biggest smart home play in years
文章标题:谷歌Gemini Live人工智能助手将直观展示对话内容
文章链接:https://www.qimuai.cn/?post=175
本站文章均为原创,未经授权请勿用于任何商业用途