Entry Date:
April 8, 2019

Understanding Real-World Actions as They Unfold


The brain has a remarkable ability to size up a scene and quickly understand what’s going on. MIT-IBM researchers are training machines to do something similar with a dataset of 1 million short video clips called Moments in Time. The models learn to recognize what’s happening in any particular frame, whether that’s pandas playing or robots dancing or a poodle jumping for joy. As AI systems learn to understand the gist of dynamic scenes, the hope is that this knowledge can be transferred to other domains.