/MediaStation

Extract assets from Media Station, Inc. CD-ROM titles (including Disney's Animated Storybook)

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Disney's Animated Storybook: The Lion King Media Station

A Python-based asset extractor and very incomplete bytecode decompiler for Media Station, Inc. CD-ROM children's titles. I loved many of these when I was growing up - especially Dalmatians. The ultimate goal of this repo is to create an engine for ScummVM, but I've found investigation and reverse-engineering easier to do out-of-tree in Python. Please join me in preserving these top-quality children's titles for future generations!

The most important knowledge needed right now is documentation for the bytecode engine (scripts). If you have knowledge or code you can contribute, please let us know!

Installation

Get it on PyPI: pip3 install MediaStation

Usage

Installing the PyPI package also installs a MediaStation script that can be invoked from the command line as follows:

#             Input directory   Export directory
#            ................. ...................
MediaStation ~/DALMATIANS/DATA ~/DalmatiansExtract

Known Titles

See and contribute to the updated list on this projects's wiki.

Support My Reverse Engineering!

Buy Me A Coffee

If you like my reverse engineering, you can buy me a matcha latte to show your appreciation 🍵!

Technical Details

Please contribute if you have more details here!

File Formats

All the data files for known titles are stored in the data/ subdirectory on the CD-ROM. These seem to be the same across the Windows and Mac versions. Some titles have additional files than these (like Tonka Garage, which has some Direct3D models for the car design activity), but these are the known files and formats unique to the Media Station engine.

Media Station titles have these types of files:

  • Context files (*.CXT)
  • Title definition file (BOOT.STM)
  • Profile (PROFILE._ST) - ONLY non-OG titles.

Each context file generally contains all the assets (and, depending on the version, the scripts) necessary to render one screen of the game. Since the format seems to have been originally designed for Disney's Interactive Storybook, this makes sense.

Context Files

  • A context file contains one or more subfiles, which are each complete and (almost) standard RIFFs.
  • Each subfile inside a context file contains one or more (almost) standard RIFF _chunk_s.
    • igod: Indicates a chunk that contains metadata about asset(s) in metadata sections.
    • a000, where 000 is a string that represents a 3-digit hexadecimal number: Indicates a chunk that contains actual asset data (mainly sounds and bitmaps) with lower-level metadata in metadata sections.
  • Each chunk can contain the following:
    • One or more metadata sections.
    • Raw asset data (PCM audio or RLE-compressed bitmaps).
  • Each metadata section contains one or more _datum_s.
  • Each datum contains a "primitive" data type (integer, string, etc.)

Title Definition (System) File

Also known as the "system" file. Contains metadata sections with global title information like the following:

  • Title compiler version.
  • Declarations of each context file.
  • File offsets of all subfiles in all context files.
  • Declarations of cursors stored as resources in the executable.

Profile

When present, contains a human-readable enumeration of metadata like the following:

  • All the assets in the title, along with the IDs and chunk FourCC(s) for that asset.
  • Declarations of the variables, constants, cursors, and so forth used in the game.

This doesn't seem to be opened/read by the executables at all while the titles are running. But there is a ton of useful cross-checking info in here.

Motivation

I re-discovered these titles when I was finding Director titles for the ScummVM Director engine at GSoC 2020. Coincidentally, the main data file extension (*.CXT) used in Media Station titles is the same as that used for protected Director cast archives. I quickly discovered these weren't Director titles but something completely different - and so this project was born to preserve them.

Engine History

Coming soon! For now, the Disney's Animated Storybook article has great background on the early titles, sourced largely from Newton Lee's books.

Future Enhancements

  • The bytecode decompiler needs a ton of work.
  • Some script data seems to be stored in the executables. That should be extracted.
  • Write a wikipedia article about the defunct Michigan-based company Media Station, Inc.

References