This is an uncensored version based on Qwen/Qwen3-VL-32B-Instruct, created using Heretic v1.0.1. This model is a powerful vision-language model with advanced visual understanding, text understanding, and multimodal reasoning capabilities, supporting various tasks such as image analysis, video understanding, and interface operation.
Multimodal
Transformers