Mistral Unveils New Multimodal AI Model Pixtral 12B for Integrated Image and Text Processing

Published September 13, 2024 0

J Jerry
Author

With 12 billion parameters, this advanced model is designed for tasks like image captioning and object counting. Pixtral 12B, built on Mistral’s earlier text model Nemo 12B, can handle images of any size or quantity through either URLs or base64 encoding, making it versatile for a wide range of applications in fields requiring visual data analysis, such as content moderation, healthcare, and more.

Although there are no working web demos yet, Mistral has plans to enable testing soon. The Pixtral 12B will be accessible via the company's chatbot platform, Le Chat, and API-serving platform, Le Plateforme, in the near future. While not yet publicly tested, Pixtral 12B’s release marks an important step in Mistral’s efforts to compete in the growing field of AI multimodal models.

Mistral has made Pixtral 12B freely available under the Apache 2.0 licence, allowing users to download, fine-tune, and deploy the model without restrictions. The model can be accessed through a torrent link on GitHub or the AI development platform Hugging Face, making it easily available to developers and researchers.

It’s part of a growing trend of multimodal models, similar to OpenAI’s GPT-4 and Anthropic’s Claude family, that integrate both visual and textual understanding. At 24GB in size, the 12-billion-parameter architecture of Pixtral 12B indicates its strong problem-solving capabilities, as models with higher parameter counts tend to perform better.

	USB-TEMP/-TC Series USB Temperature Measurement Devices Measure thermocouples and RTD’s with Digilent USB-based DAQ devices.
	RF Technology Solutions Infineon’s RF solutions are ideal for wireless infrastructure, wearables, and FM portable devices.
	InnoSwitch3-EP InnoSwitch™3-EP family of offline CV/CC QR flyback switcher ICs feature 900 V PowiGaN™ GaN switches.
	ECX-1637B Low-Aging Compact Crystal The ECX-1637B is a low-aging crystal. This component is ideal for wireless and IoT applications.
	Digi-Key Digital Solutions Simplify your B2B ordering process with three integration options: API, EDI, and Punchout.
	ACS71240 Current Sensors The ACS71240 is designed to replace shunt resistors in applications that require small size.
	Johanson Technology Inc. 0898CP14C0035001T Compact 900MHz coupler for ISM, LoRa®, IoT. EIA 0603, RoHS compliant for seamless PCB integration.
	AKX00069 Plug and Make Kit Arduino’s Plug and Make Kit is the perfect solution to seamlessly integrating hardware.

Mistral Unveils New Multimodal AI Model Pixtral 12B for Integrated Image and Text Processing

COMPANY

PROJECT

OUR NETWORK

Continue to site >>>

Mistral Unveils New Multimodal AI Model Pixtral 12B for Integrated Image and Text Processing

Join 100K+ Subscribers

COMPANY

PROJECT

OUR NETWORK