by Lars Erik Holmquist, Activist Post:
Google has launched Gemini, a new artificial intelligence (AI) system that can seemingly understand and talk intelligently about almost any kind of prompt – pictures, text, speech, music, computer code and much more.
This type of AI system is known as a multimodal model. It’s a step beyond just being able to handle text or images as previous ones have. And it provides a strong hint of where AI may be going next: being able to analyse and respond to real-time information coming from the outside world.
TRUTH LIVES on at https://sgtreport.tv/
Although Gemini’s capabilities might not be quite as advanced as they seemed in a viral video, which was edited from carefully curated text and still image prompts, it is clear that AI systems are rapidly advancing. They are heading towards an ability to handle more and more complex inputs and outputs.
To develop new capabilities, AI systems are highly dependent on the kind of “training” data they have access to. They are exposed to this data to help them improve at what they do, including making inferences such as recognising a face in a picture, or writing an essay.
At the moment, the data that companies such as Google, OpenAI, Meta and others train their models on are still mainly harvested from digitised information on the internet. However, there are efforts to radically expand the scope of the data that AI can work on. For example, by using always-on cameras, microphones and other sensors, it would be possible to let an AI know what is going on in the world as it happens.
Real-time data
Google’s new Gemini system has shown that it can understand real-time content such as live video and human speech. With new data and sensors, AI will be able to observe, discuss and act upon occurrences in the real world.
The most obvious example of this is with self-driving cars, which already collect enormous amounts of data as they are driving on our roads. This information ends up on the manufacturers’ servers where it is used not just in the moment of operating the vehicle, but to build long term computer-based models of driving situations that can support better traffic flow, or help authorities identify suspicious or criminal behaviour.
In the home, motion sensors, voice assistants and security cameras are already used to detect activity and pick up on our habits. Other “smart” appliances are appearing on the market all the time. While early uses for this are familiar, such as optimising heating for better energy usage, the understanding of habits will get much more advanced.
This means that an AI can both infer activities in the home, and even predict what will happen in the future. This data could then be used, for instance, by doctors to detect early onsets of ailments such as diabetes or dementia, as well as to recommend and follow up on changes in lifestyle.