FT商学院

AI groups rush to redesign model testing and create new benchmarks
AI企业正在加紧重新设计模型测试并创建新的基准

Rapidly advancing technology is surpassing current methods of evaluating and comparing large language models
快速发展的技术正在超越当前评估和比较大型语言模型的方法。