competitive programming – Experimental News Clipping Site

Slashdot: How Do Olympiad Medalists Judge LLMs in Competitive Programming?

Jun 17, 2025

—

by

Source URL: https://slashdot.org/story/25/06/17/149238/how-do-olympiad-medalists-judge-llms-in-competitive-programming?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: How Do Olympiad Medalists Judge LLMs in Competitive Programming? Feedly Summary: AI Summary and Description: Yes Summary: The text discusses a newly established benchmark demonstrating that large language models (LLMs) are not yet capable of outperforming elite human coders, particularly in problem-solving contexts. The findings indicate limitations in the…

Hacker News: To avoid being replaced by LLMs, do what they can’t

Feb 15, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.seangoedecke.com/what-llms-cant-do/ Source: Hacker News Title: To avoid being replaced by LLMs, do what they can’t Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses the implications of advanced large language models (LLMs) on the field of software engineering, outlining strategies for engineers to adapt in light of the impending shift…

Hacker News: Notes on OpenAI O3-Mini

Feb 1, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jan/31/o3-mini/ Source: Hacker News Title: Notes on OpenAI O3-Mini Feedly Summary: Comments AI Summary and Description: Yes Summary: The announcement of OpenAI’s o3-mini model marks a significant development in the landscape of large language models (LLMs). With enhanced performance on specific benchmarks and user functionalities that include internet search capabilities, o3-mini aims to…

Simon Willison’s Weblog: OpenAI o3-mini, now available in LLM

Jan 31, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jan/31/o3-mini/#atom-everything Source: Simon Willison’s Weblog Title: OpenAI o3-mini, now available in LLM Feedly Summary: o3-mini is out today. As with other o-series models it’s a slightly difficult one to evaluate – we now need to decide if a prompt is best run using GPT-4o, o1, o3-mini or (if we have access) o1 Pro.…

Hacker News: Performance of LLMs on Advent of Code 2024

Dec 30, 2024

—

by

system automation

in Uncategorized

Source URL: https://www.jerpint.io/blog/advent-of-code-llms/ Source: Hacker News Title: Performance of LLMs on Advent of Code 2024 Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses an experiment evaluating the performance of Large Language Models (LLMs) during the Advent of Code 2024 challenge, revealing that LLMs did not perform as well as expected. The…

Hacker News: Konwinski Prize

Dec 16, 2024

—

by

system automation

in Uncategorized

Source URL: https://andykonwinski.com/2024/12/12/konwinski-prize.html Source: Hacker News Title: Konwinski Prize Feedly Summary: Comments AI Summary and Description: Yes Summary: The text introduces the K Prize, a $1 million competition aimed at enhancing open source AI development through a benchmarking initiative called SWE-bench, which focuses on coding performance without the risk of cheating. It underscores the importance…

Tag: competitive programming

Slashdot: How Do Olympiad Medalists Judge LLMs in Competitive Programming?

Hacker News: To avoid being replaced by LLMs, do what they can’t

Hacker News: Notes on OpenAI O3-Mini

Simon Willison’s Weblog: OpenAI o3-mini, now available in LLM

Hacker News: Performance of LLMs on Advent of Code 2024

Hacker News: Konwinski Prize