Vietnamese Speech to Text: Dataset and Methodology