Google’s teaching AI to ‘see’ and ‘hear’ at the same time

A team of scientists from Google Research, the Alan Turing Institute, and Cambridge University recently unveiled a new state of the art (SOTA) multimodal transformer for AI. In other words, they’re teaching AI how to ‘hear’ and ‘see’ at the same time. Up front: You’ve probably heard about transformer AI systems such as GPT-3. At their core, they process and categorize data from a specific kind of media stream. Under the current SOTA paradigm, if you wanted to parse the data from a video you’d need several AI models running concurrently. You’d need a model that’s been trained on videos…

This story continues at The Next Web

Or just read more coverage about: Google

from The Next Web https://ift.tt/3d7V5Jw

NSTRPI Log

Search This Blog

Google’s teaching AI to ‘see’ and ‘hear’ at the same time — here’s why that matters

Comments

Post a Comment