AI groups rush to redesign model testing and create new benchmarks
AI企业正在加紧重新设计模型测试并创建新的基准

Rapidly advancing technology is surpassing current methods of evaluating and comparing large language models
快速发展的技术正在超越当前评估和比较大型语言模型的方法。

AI groups rush to redesign model testing and create new benchmarksAI企业正在加紧重新设计模型测试并创建新的基准