site:www.marktechpost.com

Ant Group Releases Ling 2.0: A Reasoning-First MoE Language Model Series Built on the ...

Every Ling 2.0 model uses the same sparse Mixture of Experts layer. Each layer has 256 routed experts and one shared expert. The router picks 8 routed experts for every token, the shared expert is ...

marktechpost

Microsoft Releases Agent Lightning: A New AI Framework that Enables Reinforcement Learning ...

How do you convert real agent traces into reinforcement learning RL transitions to improve policy LLMs without changing your existing agent stack? Microsoft AI team releases Agent Lightning to help ...

marktechpost

MiniMax Releases MiniMax M2: A Mini Open Model Built for Max Coding and Agentic Workflows ...

Can an open source MoE truly power agentic coding workflows at a fraction of flagship model costs while sustaining long-horizon tool use across MCP, shell, browser, retrieval, and code? MiniMax team ...

marktechpost

IBM AI Team Releases Granite 4.0 Nano Series: Compact and Open-Source Small Models Built ...

What is new in Granite 4.0 Nano series? Granite 4.0 Nano consists of four model lines and their base counterparts. Granite 4.0 H 1B uses a hybrid SSM based architecture and is about 1.5B parameters.

marktechpost

Zhipu AI Releases ‘Glyph’: An AI Framework for Scaling the Context Length through ...

Can we render long texts as images and use a VLM to achieve 3–4× token compression, preserving accuracy while scaling a 128K context toward 1M-token workloads? A team of researchers from Zhipu AI ...

marktechpost

A New AI Research from Anthropic and Thinking Machines Lab Stress Tests Model Specs and ...

AI companies use model specifications to define target behaviors during training and evaluation. Do current specs state the intended behaviors with enough precision, and do frontier models exhibit ...

marktechpost

Salesforce AI Research Introduces WALT (Web Agents that Learn Tools): Enabling LLM agents ...

Web agents often fail when layouts shift or when tasks require long sequences. WALT targets this failure mode by mining site functionality offline, then exposing it as tools that encapsulate ...

marktechpost

Google vs OpenAI vs Anthropic: The Agentic AI Arms Race Breakdown

In this article we will analyze how Google, OpenAI, and Anthropic are productizing ‘agentic’ capabilities across computer-use control, tool/function calling, orchestration, governance, and enterprise ...

marktechpost

Model Context Protocol (MCP) vs Function Calling vs OpenAPI Tools — When to Use Each?

Orchestration Host routes across many servers/tools App-local chaining Agent/toolkit routes intents → operations ...

marktechpost

UltraCUA: A Foundation Computer-Use Agents Model that Bridges the Gap between General ...

Computer-use agents have been limited to primitives. They click, they type, they scroll. Long action chains amplify grounding errors and waste steps. Apple Researchers introduce UltraCUA, a foundation ...

marktechpost

Anthropic Launches Claude Haiku 4.5: Small AI Model that Delivers Sonnet-4-Level Coding ...

The post emphasizes coding parity with Sonnet 4 and computer-use gains relative to Sonnet 4 under these scaffolds. Users should replicate with their own orchestration, tool stacks, and thinking ...

marktechpost

Alibaba’s Qwen AI Releases Compact Dense Qwen3-VL 4B/8B (Instruct & Thinking) With FP8 ...

What’s in the release? SKUs and variants: The new additions comprise four dense models— Qwen3-VL-4B and Qwen3-VL-8B, each in Instruct and Thinking editions—alongside FP8 versions of the 4B/8B Instruct ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果