Tag: world scenarios
-
Hacker News: Evals are not all you need
Source URL: https://www.marble.onl/posts/evals_are_not_all_you_need.html Source: Hacker News Title: Evals are not all you need Feedly Summary: Comments AI Summary and Description: Yes Summary: The text critiques the use of evaluations (evals) for assessing AI systems, particularly large language models (LLMs), arguing that they are inadequate for guaranteeing performance or reliability. It highlights various limitations of evals,…