/goppt

Go reader for the legacy Microsoft PowerPoint .ppt binary files

Primary LanguageGoApache License 2.0Apache-2.0

goppt Go Report Card Go Reference

Native Go text extractor from the legacy MS PPT (Microsoft PowerPoint) binary files.

Example

f, err := os.Open("testdata/simplepres.ppt")
if err != nil {
  handleErr(err)
}
text, err := goppt.ExtractText(f)
if err != nil {
  handleErr(err)
}
fmt.Println(text)

Special Thanks

A lot of thanks to https://github.com/richardlehane/mscfb and its author Richard Lehane. It helped a lot with parsing Microsoft old binary format.

Also I am grateful to Alex Rembish with PHP text extraction implementation.