Unleashing the Power of Gemini: Google’s Revolutionary Multimodal genAI Engine

By ad-astra
11 months Ago

Unleashing the Power of Gemini: Google’s Revolutionary Multimodal genAI Engine

Google has announced the ⁤reconstruction and renaming of its Bard chatbot to⁤ Gemini, offering⁢ the industry’s first multimodal generative AI (genAI) platform‍ that goes beyond text to provide human-like⁢ responses. ⁤This release represents a direct challenge to Microsoft’s Copilot and ‍other chatbots ‍based only ‍on large language model (LLM)‍ technology.

Gemini is the only native ⁢multimodal generative AI‌ model available, according to Chirag‍ Dekate, a vice⁣ president analyst at Gartner. This⁤ is a ‌significant development as it allows a single genAI engine to perform individual tasks ‍more accurately by learning from a larger body of knowledge, ‍positioning Google ‍at the forefront of the genAI pack.

Google first unveiled its Gemini AI model in December, touting multimodal capabilities that allow it to combine different‍ types ⁤of information, including text, code, audio, images, and video. ⁣Unlike LLM-only AI engines, Gemini doesn’t rely on ⁢the ⁤same technology and⁣ can be trained using all types of media and content.

Enterprises can now create⁤ a chatbot that is no longer confined to loading ⁤answers to queries from text on which its LLM has been trained. ⁣This opens up new⁤ possibilities for‌ creating⁢ a more immersive chatbot experience, ‍particularly in healthcare, where a multimodal genAI engine can ⁢ingest various types of data to⁣ create more ‍accurate prognoses and treatment outcomes.

Spending on genAI solutions is expected to reach⁣ $40 billion in 2024, up from $19.4 billion in 2023.

2024-02-13 09:00:04
Post from www.computerworld.com

Categories: Tech

Related Content

Headline