Tag: fairness

  • Slashdot: Study Accuses LM Arena of Helping Top AI Labs Game Its Benchmark

    Source URL: https://slashdot.org/story/25/05/01/0525208/study-accuses-lm-arena-of-helping-top-ai-labs-game-its-benchmark?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Study Accuses LM Arena of Helping Top AI Labs Game Its Benchmark Feedly Summary: AI Summary and Description: Yes Summary: The report highlights significant concerns regarding transparency and fairness in AI benchmarking, particularly focusing on allegations of biased practices within the LM Arena. Such revelations could impact the trustworthiness…

  • Slashdot: Meta Says Llama 4 Targets Left-Leaning Bias

    Source URL: https://tech.slashdot.org/story/25/04/10/1628209/meta-says-llama-4-targets-left-leaning-bias?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Meta Says Llama 4 Targets Left-Leaning Bias Feedly Summary: AI Summary and Description: Yes Summary: Meta’s announcement regarding the Llama 4 AI model focuses on addressing political bias, particularly “left-leaning” tendencies, a significant evolution in the discourse surrounding AI bias, previously centered on race, gender, and nationality. Detailed Description:…

  • The Register: Meta accused of Llama 4 bait-and-switch to juice AI benchmark rank

    Source URL: https://www.theregister.com/2025/04/08/meta_llama4_cheating/ Source: The Register Title: Meta accused of Llama 4 bait-and-switch to juice AI benchmark rank Feedly Summary: Did Facebook giant rizz up LLM to win over human voters? It appears so Meta submitted a specially crafted, non-public variant of its Llama 4 AI model to an online benchmark that may have unfairly…

  • Simon Willison’s Weblog: Quoting lmarena.ai

    Source URL: https://simonwillison.net/2025/Apr/8/lmaren/#atom-everything Source: Simon Willison’s Weblog Title: Quoting lmarena.ai Feedly Summary: We’ve seen questions from the community about the latest release of Llama-4 on Arena. To ensure full transparency, we’re releasing 2,000+ head-to-head battle results for public review. […] In addition, we’re also adding the HF version of Llama-4-Maverick to Arena, with leaderboard results…