/videogrep

Primary LanguagePythonMIT LicenseMIT

Videogrep

Videogrep searches through dialog in video files (using .srt subtitle tracks) and makes supercuts based on what it finds.

##Requirements Clone this repository, and then install pattern and moviepy along with the other requirements.

pip install -r requirements.txt

Install ffmpeg with Ogg/Vorbis support. If you're on a mac with homebrew you can install ffmpeg with:

brew install ffmpeg --with-libvpx --with-libvorbis

You may need to change the path to your installation of ffmpeg by modifying the first line of videogrep.py:

FFMPEG_BINARY = '/usr/local/bin/ffmpeg'

##How to use it The most basic use:

python videogrep.py --input path/to/video_or_folder --search 'search phrase'

You can put any regular expression in the search phrase.

You can also search for part-of-speech tags using Pattern. See the Pattern-Search documentation for some details about how this works, and the Penn Treebank tag set for a list of usuable part-of-speech tags. For example the following will search for every line of dialog that contains an adjective (JJ) followed by a singular noun (NN):

python videogrep.py --input path/to/video_or_folder --search 'JJ NN' --search-type pos

You can also do a hypernym search - which essentially searches for words that fit into a specific category. The following, for example, will search for any line of dialog that references a liquid (like water, coffee, beer, etc.):

python videogrep.py --input path/to/video_or_folder --search 'liquid' --search-type hyper

NOTE: videogrep requires the subtitle track and the video file to have the exact same name, up to the extension. For example, my_movie.mp4 and my_movie.srt will work, my_movie.mp4 and my_movie_subtitle.srt will not work.

###Options

videogrep can take a number of options:

####--input / -i Video or subtitle file, or folder containing multiple files

####--output / -o Name of the file to generate. By default this is "supercut.mp4"

####--search / -s Search term

####--search-type / -st Type of search you want to perform. There are three options:

  • re: regular expression (this is the default).
  • pos: part of speech search (uses pattern.search). For example 'JJ NN' would return all lines of dialog that contain an adjective followed by a noun.
  • hyper: hypernym search. For example 'body parts' grabs all lines of dialog that reference a body part

####--max-clips / -m Maximum number of clips to use for the supercut

####--test / -t Show the search results without making the supercut

####--randomize / -r Randomize the order of the clips

####--padding / -p Padding in milliseconds to add to the start and end of each clip

##Samples