Vision Language Model

Fastest AI Vision Model for Your Laptop : Liquid AI LFM 2.5

Liquid AI’s LFM 2.5 runs a vision-language model locally in your browser via WebGPU and ONNX Runtime, working offline once ...

Ai2 releases MolmoWeb, an open-weight visual web agent with 30K human task trajectories and a full training stack

Ai2's MolmoWeb is the first open-weight visual web agent to ship with its full training dataset, giving enterprise teams the ...

DeepRoute.ai Presents 40B Vision-Language-Action Foundation Model at NVIDIA GTC 2026, Accelerating Autonomous Driving at Scale

At NVIDIA GTC 2026, DeepRoute.ai presented a comprehensive introduction to its 40-billion-parameter Vision-Language-Action (VLA) Foundation Model architecture, representing a fundamental breakthrough ...

Geeky Gadgets

Top AI Vision-Language Models : What You Need to Know

Imagine a world where your devices not only see but truly understand what they’re looking at—whether it’s reading a document, tracking where someone’s gaze lands, or answering questions about a video.

Nvidia expands open AI model portfolio and enlists partners for frontier development

Touting its status as the “world’s largest contributor to open-source AI,” Nvidia Corp. is doubling down on open artificial ...

SiliconANGLE

Hugging Face open-sources world’s smallest vision language model

Hugging Face Inc. today open-sourced SmolVLM-256M, a new vision language model with the lowest parameter count in its category. The algorithm’s small footprint allows it to run on devices such as ...

VentureBeat

DeepSeek unleashes 'Janus Pro 7B' vision model amidst AI stock bloodbath, igniting fresh fears of Chinese tech dominance

DeepSeek, the fast-growing Chinese AI company, is shaking up global technology yet again. Just as the rapid rise of the company's frontier AI models triggered a selloff of U.S. artificial intelligence ...

TechCrunch

Cohere claims its new Aya Vision AI model is best-in-class

Cohere For AI, AI startup Cohere’s nonprofit research lab, this week released a multimodal “open” AI model, Aya Vision, the lab claimed is best-in-class. Aya Vision can perform tasks like writing ...

Forbes

How Vision Language Models Will Shape The Future Of Self-Driving Cars

As I highlighted in my last article, two decades after the DARPA Grand Challenge, the autonomous vehicle (AV) industry is still waiting for breakthroughs—particularly in addressing the “long tail ...

Security

Milestone Systems Launches Traffic-Focused Vision Language Model

Milestone Systems has released an advanced vision language model (VLM) specializing in traffic understanding, powered by NVIDIA Cosmos Reason, a framework designed to enable advanced reasoning across ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results