gen‑ai.news
← Back
Multimodal

MiniMax M3 launches on NVIDIA platform with Free Endpoint

MiniMax M3 has arrived on NVIDIA's accelerated compute platform, bringing with it a free API endpoint that lowers the barrier to entry for developers and researchers wanting to experiment with the model. The availability through NVIDIA's infrastructure means users can access M3 without immediately committing to usage costs, which is a meaningful step for broader adoption.

The model is multimodal, handling text, image, and video inputs within a single architecture. This breadth of modality support puts M3 in a growing category of models designed to work across content types rather than specializing in just one - a design direction that reflects where much of the industry has been heading as practical use cases increasingly involve mixed media.

One of the more technically notable aspects of M3 is its use of sparse attention. Standard attention mechanisms scale poorly with sequence length, becoming computationally expensive as context grows. Sparse attention addresses this by selectively attending to subsets of tokens rather than every possible pair, which allows the model to handle longer inputs - documents, extended video sequences, or multi-turn interactions - without a proportional explosion in compute cost.

Running on NVIDIA accelerated compute ties the model to hardware that is already widely used in production AI workloads, which should ease integration for teams already operating within that ecosystem. MiniMax, a Shanghai-based AI company, has been building out its model portfolio steadily, and making M3 accessible through a major platform like NVIDIA's represents a clear effort to reach a wider developer audience outside of its primary markets.

Enjoy this story? Get the next one in your inbox.

Twice a week: the most important stories in generative image and video AI, distilled into a 2-minute read.

Free. Unsubscribe any time. No spam, ever.

Your next read

Multimodal

Powering the world’s first AI arts museum

Rafik Anadol Studio has opened Dataland, billed as the world's first museum dedicated to AI arts, with Google Cloud providing the underlying infrastructure and Google Arts & Culture lending institutional support. The museum marks a notable step in bringing generative AI art into a dedicated physical and cultural space. It represents one of the more concrete attempts to treat AI-generated art as a serious curatorial discipline.

Multimodal

Photoshop and Premiere now have AI assistants

Adobe has launched a public beta bringing AI assistants to Photoshop, Premiere, Illustrator, InDesign, and Frame.io. Each assistant is tailored to its host application, handling tasks specific to that tool rather than acting as a general-purpose chatbot. The rollout marks a significant step in Adobe's broader push to embed conversational AI throughout its Creative Cloud suite.

Multimodal

Adobe adds AI agents to Photoshop, Premiere, and more Creative Cloud apps

Adobe is integrating AI agents into its core Creative Cloud applications, including Photoshop and Premiere, allowing users to describe a desired outcome in plain language while the software carries out the underlying multi-step tasks. The rollout also extends to third-party platforms such as ChatGPT and Claude. The move reflects a broader shift in how professional creative tools are beginning to handle complex, multi-action workflows.