huggingface AI

2 picks we've reviewed.

MiniMax M3

MiniMax's third-generation model — M3 is an image-text-to-text multimodal model built to process visual and text inputs together, advancing from MiniMax's earlier text-only releases.

MiniMax

open-source · Apache 2.0 · 128K tokens (E2B/E4B)3w ago

model multimodal

Gemma 4

Google DeepMind's fourth-generation open-weight model family — five sizes from 2B to 31B, Apache 2.0 licensed, with the 12B Unified variant accepting text, image, audio, and video in a single encoder-free architecture.

Google DeepMind