fine-tuning – Page 2 – Experimental News Clipping Site

Cloud Blog: vLLM Performance Tuning: The Ultimate Guide to xPU Inference Configuration

Aug 25, 2025

—

by

Source URL: https://cloud.google.com/blog/topics/developers-practitioners/vllm-performance-tuning-the-ultimate-guide-to-xpu-inference-configuration/ Source: Cloud Blog Title: vLLM Performance Tuning: The Ultimate Guide to xPU Inference Configuration Feedly Summary: Additional contributors include Hossein Sarshar, Ashish Narasimham, and Chenyang Li. Large Language Models (LLMs) are revolutionizing how we interact with technology, but serving these powerful models efficiently can be a challenge. vLLM has rapidly become…

Cloud Blog: 101+ gen AI use cases with technical blueprints

Aug 21, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/real-world-gen-ai-use-cases-with-technical-blueprints/ Source: Cloud Blog Title: 101+ gen AI use cases with technical blueprints Feedly Summary: A little over a year ago, we published a list of generative AI use cases that has since grown to include more than 600 examples of how organizations are putting AI to work. Yet for many developers and…

Cloud Blog: How startups can help build — and benefit from — the AI revolution

Aug 21, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/industry-leaders-on-whats-next-for-startups-and-ai/ Source: Cloud Blog Title: How startups can help build — and benefit from — the AI revolution Feedly Summary: Startups are at the forefront of generative AI development, pushing current capabilities and unlocking new potential. Building on our Future of AI: Perspectives for Startups 2025 report, several of the AI industry leaders…

Simon Willison’s Weblog: Introducing Gemma 3 270M: The compact model for hyper-efficient AI

Aug 14, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Aug/14/gemma-3-270m/#atom-everything Source: Simon Willison’s Weblog Title: Introducing Gemma 3 270M: The compact model for hyper-efficient AI Feedly Summary: Introducing Gemma 3 270M: The compact model for hyper-efficient AI New from Google: Gemma 3 270M, a compact, 270-million parameter model designed from the ground up for task-specific fine-tuning with strong instruction-following and text structuring…

Cloud Blog: Google is a Leader in the 2025 Gartner® Magic Quadrant™ for Container Management

Aug 12, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/containers-kubernetes/2025-gartner-magic-quadrant-for-container-management-leader/ Source: Cloud Blog Title: Google is a Leader in the 2025 Gartner® Magic Quadrant™ for Container Management Feedly Summary: We’re excited to share that Gartner has recognized Google as a Leader for the third year in a row in the 2025 Gartner® Magic Quadrant™ for Container Management, based on its Completeness of…

Cisco Security Blog: Foundation-sec-8B-Instruct: An Out-of-the-Box Security Copilot

Aug 6, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://feedpress.me/link/23535/17112350/foundation-sec-8b-instruct-out-of-the-box-security-copilot Source: Cisco Security Blog Title: Foundation-sec-8B-Instruct: An Out-of-the-Box Security Copilot Feedly Summary: Foundation-sec-8B-Instruct layers instruction fine-tuning on top of our domain-focused base model, giving you a chat-native copilotthat understands security. AI Summary and Description: Yes Summary: The text describes a new method of fine-tuning AI models, specifically designed for security applications. This…

OpenAI : Estimating worst case frontier risks of open weight LLMs

Aug 5, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://openai.com/index/estimating-worst-case-frontier-risks-of-open-weight-llms Source: OpenAI Title: Estimating worst case frontier risks of open weight LLMs Feedly Summary: In this paper, we study the worst-case frontier risks of releasing gpt-oss. We introduce malicious fine-tuning (MFT), where we attempt to elicit maximum capabilities by fine-tuning gpt-oss to be as capable as possible in two domains: biology and…

Cloud Blog: Announcements for AI Hypercomputer: The latest infrastructure news for ML practitioners

Aug 5, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/q2-2025-ai-hypercomputer-updates/ Source: Cloud Blog Title: Announcements for AI Hypercomputer: The latest infrastructure news for ML practitioners Feedly Summary: Curious about the latest in AI infrastructure from Google Cloud? Every three months we share a roundup of the latest AI Hypercomputer news, resources, events, learning opportunities, and more. Read on to learn new ways…

Cloud Blog: Understanding Calendar mode for Dynamic Workload Scheduler: Reserve ML GPUs and TPUs

Jul 28, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/compute/dynamic-workload-scheduler-calendar-mode-reserves-gpus-and-tpus/ Source: Cloud Blog Title: Understanding Calendar mode for Dynamic Workload Scheduler: Reserve ML GPUs and TPUs Feedly Summary: Organizations need ML compute resources that can accommodate bursty peaks and periodic troughs. That means the consumption models for AI infrastructure need to evolve to be more cost-efficient, provide term flexibility, and support rapid…

Cloud Blog: Your guide to taking an open model from discovery to a production-ready endpoint on Vertex AI

Jul 25, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/take-an-open-model-from-discovery-to-endpoint-on-vertex-ai/ Source: Cloud Blog Title: Your guide to taking an open model from discovery to a production-ready endpoint on Vertex AI Feedly Summary: Developers building with gen AI are increasingly drawn to open models for their power and flexibility. But customizing and deploying them can be a huge challenge. You’re often left wrestling…

Tag: fine-tuning