May 8 – Google is reportedly working on expanding its Gemini Live feature to perform tasks within apps, according to a new report. The tech giant is said to be developing a way for Gemini Live to interact with apps directly, potentially eliminating the need for user intervention.
Gemini Live, a two-way, real-time voice conversation tool, allows users to verbally ask questions and receive responses from an artificial intelligence (AI) in a human-like manner. Recently, Google introduced an update that enables Gemini Live to access a device’s camera, providing real-time answers to queries about the user’s surroundings.
The new app integration feature was uncovered by Android Authority during an APK teardown of the Google app for Android (version 16.17.38.sa.arm64). The report noted the presence of the code string “Extensions_on_Live_Phase_One,” which suggests the feature’s early development. Although Google has now switched to using the term “apps” in its branding for Gemini, the internal use of “extensions” may still be present.
While the details are still unclear, the code indicates that Gemini Live may soon be able to connect with a variety of apps, although it remains uncertain whether this will apply to first-party apps, third-party apps, or both. The term “Phase One” suggests that this integration could be rolled out in stages, following a similar approach taken with Gemini AI, which was gradually integrated with both first-party and third-party apps.
Speculation surrounding the new feature aligns with an email reportedly sent to Gemini Advanced users, which referenced potential app integration.
Google has not yet commented on the report or provided further details on the rollout timeline of this new functionality. However, the development is seen as a significant expansion of the capabilities of Gemini Live, which aims to make interactions with AI more seamless and efficient.
As Google continues to innovate in the AI space, the potential integration of Gemini Live with apps could further enhance the functionality of its assistant, making it more versatile in answering user queries and performing tasks across multiple platforms.
4o mini