Google paper about Video Classification using Neural Networks.
paper: https://plus.google.com/+ResearchatGoogle/posts/eqSPSviY2CH
Look at the video below, showing which sport the algorithm predicted frame-by-frame.
I'm wondering how long did it take to classify all the frames. Imagine if it was possible to do in real-time...
PS: I got an answer from Andrej Karpathy (thanks Andrej):
"inference is embarrassingly parallel process so this video could be done almost instantly given enough CPUs on cluster as done in this work. On modern GPUs, CNNs like this run at about 2ms/frame, and since 72 seconds = ~2160 frames you'd expect somewhere around 5 seconds for this video."
source: https://plus.google.com/+ResearchatGoogle/posts/eqSPSviY2CH
No comments:
Post a Comment