Tag: stochastic gradient descent

  • Hacker News: An attempt at AGI on the Tokio Runtime

    Source URL: https://www.christo.sh/building-agi-on-the-tokio-runtime/ Source: Hacker News Title: An attempt at AGI on the Tokio Runtime Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text outlines an individual’s experimental journey to build Artificial General Intelligence (AGI) through a biologically inspired neural network running on the Tokio Runtime. The project involves a unique approach to…

  • Hacker News: No More Adam: Learning Rate Scaling at Initialization Is All You Need

    Source URL: https://arxiv.org/abs/2412.11768 Source: Hacker News Title: No More Adam: Learning Rate Scaling at Initialization Is All You Need Feedly Summary: Comments AI Summary and Description: Yes Summary: The text presents a novel optimization technique called SGD-SaI that enhances the stochastic gradient descent (SGD) algorithm for training deep neural networks. This method simplifies the process…