Nvidia Released a Powerful Open-Source AI Model Named ‘NVLM 1.0’

IBL News | New York

Nvidia released a powerful open-source AI model that competes with proprietary systems like OpenAI and Google this week.

Nvidia researchers explained in a paper that its new NVLM 1.0 family of large multimodal language models, led by the 72 billion parameter NVLM-D-72B, “demonstrates exceptional performance across vision-language tasks, rivaling the leading proprietary models (e.g., GPT-4o) and open-access models.”

By making the model weights publicly available and promising to release the training code, Nvidia is breaking from the trend of keeping advanced AI systems closed.

The initiative grants researchers and developers unprecedented access to cutting-edge technology.

Benchmark results comparing NVIDIA’s NVLM-D model to AI giants like GPT-4, Claude 3.5, and Llama 3-V, showing NVLM-D’s competitive performance across various visual and language tasks. (Credit: arxiv.org)

 

NVIDIA’s new AI model analyzes a meme comparing academic abstracts to full papers, demonstrating its ability to interpret visual humor and scholarly concepts. (Credit: arxiv.org)

Nvidia’s decision to make such a powerful model openly available could accelerate AI research and development across the field.

Experts said that this move could spark a chain reaction as other well funded  companies may feel pressure to open their research, potentially accelerating AI progress across the board.

Nvidia’s decision also raises questions about the future of AI business models. If state-of-the-art models become freely available, companies may need to rethink how they create value and maintain competitive edges in AI.

 

[Disclosure: ibl.ai, the parent company of iblnews.org, has NVIDIA as a client]