Imagine watching a favorite movie when suddenly the sound stops. The data representing the audio is missing. All that's left are images. What if artificial intelligence (AI) could analyze each frame of the video and provide the audio automatically based on the pictures, reading lips and noting each time a foot hits the ground?