Kimi-VL-A3B-Thinking is a multi-modal LLM that can understand text, video images, and generate text with thinking processes. this specific space was hacked to also accept videos, and the system prompt has been changed to favor video analysis.