Artificial-intelligence

  • Published on
    10 min0Comments
    This post explores the X-MAS framework, which investigates the benefits of using diverse Large Language Models (LLMs) within multi-agent systems (MAS). It details X-MAS-Bench, a comprehensive testbed evaluating 27 LLMs across 5 domains and 5 MAS functions, revealing that no single LLM excels universally. Building on these findings, the paper demonstrates significant performance improvements (up to 47-63% on challenging math problems) when transitioning homogeneous MAS to heterogeneous configurations, highlighting the potential of leveraging collective intelligence from diverse LLMs.
    Read more
  • Published on
    12 min0Comments
    Explore the history and technical approaches in the quest for Artificial General Intelligence, from early symbolic AI and expert systems to deep learning and probabilistic programming, illustrated by real-world applications like nuclear monitoring.
    Read more
  • Published on
    5 min0Comments
    Explore the latest developments in Manus AI, China's groundbreaking autonomous AI agent that promises to revolutionize tasks from web development to financial analysis. Discover its capabilities, limitations, and how it stacks up against competitors like OpenAI's Deep Research.
    Read more
  • Published on
    5 min0Comments
    DeepSeek is accelerating the release of its R2 model, promising groundbreaking advancements in AI reasoning, coding, and multilingual capabilities. With a focus on cost efficiency and open-source innovation, R2 could challenge Western AI giants like OpenAI and Anthropic.
    Read more