ChatGPT has been a game-changer in AI, allowing users to experience the power of natural language processing. However, Microsoft has taken things a step further with the launch of its all-around artificial intelligence model, AI-Kosmos-1. While ChatGPT is a text-only large language model (LLM), AI-Kosmos-1 is a multimodal large language model (MLLM) that can process text, audio, images, and video. It can handle tasks requiring human-like thinking, including visual puzzle-solving, visual text recognition, understanding natural language commands, and more.
The capabilities of AI-Kosmos-1 are awe-inspiring. The researchers noted in their academic paper that “multimodal perception is necessary for the realization of artificial intelligence.” The vision examples in the article illustrate the model’s ability to analyze images, answer questions about them, read text from images, write captions for photos, and perform visual IQ tests with an accuracy of 22-26%.
The launch of AI-Kosmos-1 marks a significant milestone in the development of artificial intelligence. With its ability to process a range of inputs, it has the potential to revolutionize the way we interact with machines and the tasks they can perform.
In conclusion, while ChatGPT has already demonstrated the power of AI, AI-Kosmos-1 takes things to the next level. As we continue to explore the possibilities of artificial intelligence, it will be exciting to see what other breakthroughs are on the horizon.
Read Also: ChatGPT voice-to-text is here: 4 cents a minute; Chinese is not very good.
Do not forget to follow us on our Facebook group and page to keep you always aware of the latest advances, News, Updates, review, and giveaway on smartphones, tablets, gadgets, and more from the technology world of the future.