The model delivers dramatically enhanced performance, with a breakthrough in long-context understanding across modalities.