We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.
MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems
A multimodal agent that can interact with its own PC in a multimodal manner.