All discussions filtered by tag "multimodal"

LLM Developments: Key Learnings 2024

Key insights into LLM advancements in 2024, including model performance, efficiency, pricing, and multimodal capabilities.

Meta Launches Spirit LM Model

Meta's Spirit LM is an open-source, multimodal AI model integrating text and speech for enhanced communication.

Liquid AI Launches Non-Transformer Models

MIT spinoff Liquid launches efficient non-transformer AI models, outperforming traditional transformers while optimizing for memory use.

Open Source AI Model Revolutionizes Agents

Open source AI model Molmo enhances AI agents, enabling better task execution on computers with visual and conversational capabilities.