Google has announced the reconstruction and renaming of its Bard chatbot to Gemini, offering the industry’s first multimodal generative AI (genAI) platform that goes beyond text to provide human-like responses. This release represents a direct challenge to Microsoft’s Copilot and other chatbots based only on large language model (LLM) technology.
Gemini is the only native multimodal generative AI model available, according to Chirag Dekate, a vice president analyst at Gartner. This is a significant development as it allows a single genAI engine to perform individual tasks more accurately by learning from a larger body of knowledge, positioning Google at the forefront of the genAI pack.
Google first unveiled its Gemini AI model in December, touting multimodal capabilities that allow it to combine different types of information, including text, code, audio, images, and video. Unlike LLM-only AI engines, Gemini doesn’t rely on the same technology and can be trained using all types of media and content.
Enterprises can now create a chatbot that is no longer confined to loading answers to queries from text on which its LLM has been trained. This opens up new possibilities for creating a more immersive chatbot experience, particularly in healthcare, where a multimodal genAI engine can ingest various types of data to create more accurate prognoses and treatment outcomes.
Spending on genAI solutions is expected to reach $40 billion in 2024, up from $19.4 billion in 2023.
2024-02-13 09:00:04
Post from www.computerworld.com