/DeepDreamVideo

implementing deep dream on video

Primary LanguagePythonGNU General Public License v2.0GPL-2.0

DeepDreamVideo

Implementing #deepdream on video

Included experiment: Deep Dreaming Fear & Loathing in Las Vegas: the Great Fan Francisco Acid Wave

The results can be seen on youtube: https://www.youtube.com/watch?v=oyxSerkkP4o

Mp4 not yet destroyed by youtube compression also at mega.nz folder together with original video file.

All single processed + unprocessed frames are also at github

deepdreamanim1 deepdreamanim2

##INSTALL Dependencies

A good overview (constantly being updated) on which software libraries to install & list of web resources/howto is at reddit: https://www.reddit.com/r/deepdream/comments/3cawxb/what_are_deepdream_images_how_do_i_make_my_own/

##Usage:

Extract 25 frames a second from the source movie

./1_movie2frames [ffmpeg|avconv] [movie.mp4] [directory]

Let a pretrained deep neural network dream on it frames, one by one, taking each new frame and adding 0-50% of the old frame into it for continuity of the hallucinated artifacts, and go drink your caffe

python 2_dreaming_time.py -i frames -o processed

different models can be loaded with:

python 2_dreaming_time.py -i frames -o processed --model_path ../caffe/models/Places205-CNN/ --model_name Places205.caffemodel

(again eat your heart out, Not a free lunch, but free models are here)

and sticking to one specific layer:

python 2_dreaming_time.py -i frames -o processed -l inception_4c/output

Once enough frames are processed (the script will cut the audio to the needed length automatically) or once all frames are done, put the frames + audio back together:

./3_frames2movie.sh [frames_directory] [original_video_with_sound]

##More information:

This repo implements a deep neural network hallucinating Fear & Loathing in Las Vegas. Visualizing the internals of a deep net we let it develop further what it think it sees.

We're using the #deepdream technique developed by Google, first explained in the Google Research blog post about Neural Network art.

Code:

parameters used (and useful to play with):

  • network: standard reference GoogLeNet model trained on ImageNet from the Caffe Model Zoo (https://github.com/BVLC/caffe/wiki/Model-Zoo)

  • iterations: 5

  • jitter: 32 (default)

  • octaves: 4 (default)

  • layers locked to moving upwards from inception_4c/output to inception_5b/output (only the output layers, as they are most sensitive to visualizing "objects", where reduce layers are more like "edge detectors") and back again

  • every next unprocessed frame in the movie clip is blended with the previous processed frame before being "dreamed" on, moving the alpha from 0.5 to 1 and back again (so 50% previous image net created, 50% the movie frame, to taking 100% of the movie frame only). This takes care of "overfitting" on the frames and makes sure we don't iteratively build more and more "hallucinations" of the net and move away from the original movie clip.

An investigation of using the MIT Places trained CNN (mainly landscapes):

https://www.youtube.com/watch?v=6IgbMiEaFRY

Installing DeepDream:

Credits

Roelof | KTH & Graph Technologies | @graphific