A playground for using machine learning techniques to analyze code Document embedding Bag-of-word Model TF-IDF Tokenizers BPE Tokenizer