Jointly learning video, audio, text to recall digital memories (facebook AI)

"Smartphone cameras have made it simple to take photos and videos on the fly. In the future, wearables such as AR glasses will make it even easier to capture things — hands free. As this becomes the norm, people should be able to recall specific moments from their vast bank of digital memories just as easy as they capture them. It’ll be valuable to build smarter AI systems that can understand what’s happening in videos on a more granular level."

#ML #Augmentation