Tag: Cerebras

  • Simon Willison’s Weblog: December in LLMs has been a lot

    Source URL: https://simonwillison.net/2024/Dec/20/december-in-llms-has-been-a-lot/#atom-everything Source: Simon Willison’s Weblog Title: December in LLMs has been a lot Feedly Summary: I had big plans for December: for one thing, I was hoping to get to an actual RC of Datasette 1.0, in preparation for a full release in January. Instead, I’ve found myself distracted by a constant barrage…

  • The Register: Cheat codes for LLM performance: An introduction to speculative decoding

    Source URL: https://www.theregister.com/2024/12/15/speculative_decoding/ Source: The Register Title: Cheat codes for LLM performance: An introduction to speculative decoding Feedly Summary: Sometimes two models really are faster than one Hands on When it comes to AI inferencing, the faster you can generate a response, the better – and over the past few weeks, we’ve seen a number…

  • The Register: Biden administration bars China from buying HBM chips critical for AI accelerators

    Source URL: https://www.theregister.com/2024/12/03/biden_hbm_china_export_ban/ Source: The Register Title: Biden administration bars China from buying HBM chips critical for AI accelerators Feedly Summary: 140 Middle Kingdom firms added to US trade blacklist The Biden administration has announced restrictions limiting the export of memory critical to the production of AI accelerators and banning sales to more than a…

  • Hacker News: Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference

    Source URL: https://cerebras.ai/blog/llama-405b-inference/ Source: Hacker News Title: Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses breakthrough advancements in AI inference speed, specifically highlighting Cerebras’s Llama 3.1 405B model, which showcases significantly superior performance metrics compared to traditional GPU solutions. This…

  • Simon Willison’s Weblog: Cerebras Coder

    Source URL: https://simonwillison.net/2024/Oct/31/cerebras-coder/#atom-everything Source: Simon Willison’s Weblog Title: Cerebras Coder Feedly Summary: Cerebras Coder Val Town founder Steve Krouse has been building demos on top of the Cerebras API that runs Llama3.1-70b at 2,000 tokens/second. Having a capable LLM with that kind of performance turns out to be really interesting. Cerebras Coder is a demo…

  • Hacker News: Cerebras Trains Llama Models to Leap over GPUs

    Source URL: https://www.nextplatform.com/2024/10/25/cerebras-trains-llama-models-to-leap-over-gpus/ Source: Hacker News Title: Cerebras Trains Llama Models to Leap over GPUs Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses Cerebras Systems’ advancements in AI inference performance, particularly highlighting its WSE-3 hardware and its ability to outperform Nvidia’s GPUs. With a reported performance increase of 4.7X and significant…

  • Simon Willison’s Weblog: llm-cerebras

    Source URL: https://simonwillison.net/2024/Oct/25/llm-cerebras/ Source: Simon Willison’s Weblog Title: llm-cerebras Feedly Summary: llm-cerebras Cerebras (previously) provides Llama LLMs hosted on custom hardware at ferociously high speeds. GitHub user irthomasthomas built an LLM plugin that works against their API – which is currently free, albeit with a rate limit of 30 requests per minute for their two…

  • Hacker News: Cerebras Inference now 3x faster: Llama3.1-70B breaks 2,100 tokens/s

    Source URL: https://cerebras.ai/blog/cerebras-inference-3x-faster/ Source: Hacker News Title: Cerebras Inference now 3x faster: Llama3.1-70B breaks 2,100 tokens/s Feedly Summary: Comments AI Summary and Description: Yes Summary: The text announces a significant performance upgrade to Cerebras Inference, showcasing its ability to run the Llama 3.1-70B AI model at an impressive speed of 2,100 tokens per second. This…

  • Slashdot: AI Chipmaker Cerebras Files For IPO To Take On Nvidia

    Source URL: https://slashdot.org/story/24/10/01/0030246/ai-chipmaker-cerebras-files-for-ipo-to-take-on-nvidia?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: AI Chipmaker Cerebras Files For IPO To Take On Nvidia Feedly Summary: AI Summary and Description: Yes Summary: Cerebras Systems, an AI chip startup, is preparing to go public with an IPO on Nasdaq, intending to compete in the AI chip market against industry giants like Nvidia. Their WSE-3…

  • Hacker News: AI chipmaker Cerebras files for IPO to take on Nvidia

    Source URL: https://www.cnbc.com/2024/09/30/cerebras-files-for-ipo.html Source: Hacker News Title: AI chipmaker Cerebras files for IPO to take on Nvidia Feedly Summary: Comments AI Summary and Description: Yes Summary: Cerebras Systems has filed for an initial public offering (IPO), aiming to provide competition to established players like Nvidia in the AI chip market. The company’s WSE-3 chip offers…