We conducted a two-phase evaluation. First, we assessed LLMs (GPT o4-mini and Gemini 2.5 Pro) on 1,000 synthetic clinical hematology/oncology vignettes with ...
Compiler testing and bug detection are critical research areas that ensure the reliability and correctness of software tools fundamental to modern computing. Contemporary compilers, which convert ...