Visual-iMessage-Demo Prototype of what Siri could look like if it could describe image messages. GPT-4V is used for visual analysis of the images, and Rime is used for the voice synthesis.