Microsoft said performance improved on the DRACO benchmark, which measures research accuracy, completeness, and objectivity across 100 tasks. Researcher with Critique recorded a 7-point increase in ...