huggingface AI
2 picks we've reviewed.
paidJust dropped
model multimodal
MiniMax M3
MiniMax's third-generation model — M3 is an image-text-to-text multimodal model built to process visual and text inputs together, advancing from MiniMax's earlier text-only releases.
MiniMax
open-source · Apache 2.0 · 128K tokens (E2B/E4B)3w ago
model multimodal
Gemma 4
Google DeepMind's fourth-generation open-weight model family — five sizes from 2B to 31B, Apache 2.0 licensed, with the 12B Unified variant accepting text, image, audio, and video in a single encoder-free architecture.
Google DeepMind