NVIDIA’s Nemotron 3 Nano Omni wants to be the eyes and ears of agents
NVIDIA’s new open multimodal model is pitched as a cheaper perception layer for agents that need to read screens, documents, video, and audio without stitching four models together.