Meta faced criticism for using an experimental version of its Llama 4 Maverick AI model to achieve a high score on LM Arena. The unmodified Maverick performed poorly compared to older models, highlighting the unreliability of benchmarking.
Read MoreDid you find this insightful?
Bad
Just Okay
Amazing