Tag: production
-
The Register: Supermicro crams 18 GPUs into a 3U AI server that’s a little slow by design
Source URL: https://www.theregister.com/2024/10/09/supermicro_sys_322gb_nr_18_gpu_server/ Source: The Register Title: Supermicro crams 18 GPUs into a 3U AI server that’s a little slow by design Feedly Summary: Can handle edge inferencing or run a 64 display command center GPU-enhanced servers can typically pack up to eight of the accelerators, but Supermicro has built a box that manages to…
-
Irrational Exuberance: Modeling impact of LLMs on Developer Experience.
Source URL: https://lethain.com/dx-llm-model/ Source: Irrational Exuberance Title: Modeling impact of LLMs on Developer Experience. Feedly Summary: In How should you adopt Large Language Models? (LLMs), we considered how LLMs might impact a company’s developer experience. To support that exploration, I’ve developed a system model of the developing software at the company. In this chapter, we’ll…
-
Cloud Blog: Moving from experimentation into production with Gemini and Vertex AI
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/experimentation-to-production-with-gemini-and-vertex-ai/ Source: Cloud Blog Title: Moving from experimentation into production with Gemini and Vertex AI Feedly Summary: You might have seen the recent stat on how 61% of enterprises are running generative AI use cases in production — and with industry leaders including PUMA, Snap, and Warner Brothers Discovery speaking at today’s Gemini…