python implementation of Bag-Of-Words encoding of local features like SIFT
Recently, I concentrated some researches on video computing. Thinking about segmenting videos by encoding(BOW) of local features like SIFT. Some experiments then were done and I decide to take some notes about it. I add some extra notes after the line of key codes.