site:www.marktechpost.com

Ant Group Releases Ling 2.0: A Reasoning-First MoE Language Model Series Built on the ...

Every Ling 2.0 model uses the same sparse Mixture of Experts layer. Each layer has 256 routed experts and one shared expert. The router picks 8 routed experts for every token, the shared expert is ...

marktechpost

Microsoft Releases Agent Lightning: A New AI Framework that Enables Reinforcement Learning ...

How do you convert real agent traces into reinforcement learning RL transitions to improve policy LLMs without changing your existing agent stack? Microsoft AI team releases Agent Lightning to help ...

marktechpost

IBM AI Team Releases Granite 4.0 Nano Series: Compact and Open-Source Small Models Built ...

What is new in Granite 4.0 Nano series? Granite 4.0 Nano consists of four model lines and their base counterparts. Granite 4.0 H 1B uses a hybrid SSM based architecture and is about 1.5B parameters.

marktechpost

MiniMax Releases MiniMax M2: A Mini Open Model Built for Max Coding and Agentic Workflows ...

Can an open source MoE truly power agentic coding workflows at a fraction of flagship model costs while sustaining long-horizon tool use across MCP, shell, browser, retrieval, and code? MiniMax team ...

marktechpost

Zhipu AI Releases ‘Glyph’: An AI Framework for Scaling the Context Length through ...

Can we render long texts as images and use a VLM to achieve 3–4× token compression, preserving accuracy while scaling a 128K context toward 1M-token workloads? A team of researchers from Zhipu AI ...

marktechpost

A New AI Research from Anthropic and Thinking Machines Lab Stress Tests Model Specs and ...

AI companies use model specifications to define target behaviors during training and evaluation. Do current specs state the intended behaviors with enough precision, and do frontier models exhibit ...

marktechpost

Salesforce AI Research Introduces WALT (Web Agents that Learn Tools): Enabling LLM agents ...

Web agents often fail when layouts shift or when tasks require long sequences. WALT targets this failure mode by mining site functionality offline, then exposing it as tools that encapsulate ...

marktechpost

Context Engineering

Anthropic recently released a guide on effective Context Engineering for AI Agents — a reminder that context is a critical yet limited resource. The... In this tutorial, we explore how to build a ...

marktechpost

Google Proposes TUMIX: Multi-Agent Test-Time Scaling With Tool-Use Mixture

TUMIX runs a group of heterogeneous agents—text-only Chain-of-Thought, code-executing, web-searching, and guided variants—in parallel, then iterates a small number of refinement rounds where each ...

marktechpost

MLPerf Inference v5.1 (2025): Results Explained for GPUs, CPUs, and AI Accelerators

decoding MLPerf Inference v5.1 2025 results, scenarios, TTFT/TPOT, power metrics for GPUs, CPUs, accelerators, datacenter, edge ...

marktechpost

Sakana AI Released ShinkaEvolve: An Open-Source Framework that Evolves Programs for ...

An open-source framework that couples LLM-driven program mutations with evolutionary search to automate algorithm discovery and optimization. Code and report are public. 2) How does it achieve higher ...

marktechpost

Perplexity Launches an AI Email Assistant Agent for Gmail and Outlook, Aimed at Scheduling ...

Perplexity introduced “Email Assistant,” an AI agent that plugs into Gmail and Outlook to draft replies in your voice, auto-label and prioritize messages, and coordinate meetings end-to-end ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果