Model instances

Pixtral

A 12 billion parameter open-source model developed by Mistral, marking the company's first foray into multimodal capabilities. Pixtral is designed to understand both images and text, released with open weights under the Apache 2.0 license.

As an instruction-tuned model, Pixtral is pre-trained on a large-scale dataset of interleaved image and text documents. Therefore, it is capable of multi-turn, multi-image conversations. Unlike previous open-source models, Pixtral maintains excellent text benchmark performance while excelling in multimodal tasks.

Key features:

Points to be cautious about:

To deploy Pixtral 12B, you can run openllm serve pixtral:12b with OpenLLM.

Reading

Articles


Tags: ai   model   vision  

Last modified 22 March 2026