Tag: model weights
-
Hacker News: ARIA: An Open Multimodal Native Mixture-of-Experts Model
Source URL: https://arxiv.org/abs/2410.05993 Source: Hacker News Title: ARIA: An Open Multimodal Native Mixture-of-Experts Model Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the introduction of “Aria,” an open multimodal native mixture-of-experts AI model designed for various tasks including language understanding and coding. As an open-source project, it offers significant advantages for…
-
Hacker News: Nvidia releases NVLM 1.0 72B open weight model
Source URL: https://huggingface.co/nvidia/NVLM-D-72B Source: Hacker News Title: Nvidia releases NVLM 1.0 72B open weight model Feedly Summary: Comments AI Summary and Description: Yes Summary: The text introduces NVLM 1.0, a new family of advanced multimodal large language models (LLMs) developed with a focus on vision-language tasks. It demonstrates state-of-the-art performance comparable to leading proprietary and…