Tag: fine-tuning

  • Cloud Blog: Etsy’s Service Platform on Cloud Run cuts deployment time from days to under an hour

    Source URL: https://cloud.google.com/blog/products/infrastructure/etsys-service-platform-on-cloud-run-cuts-deployment-time-from-days-to-minutes/ Source: Cloud Blog Title: Etsy’s Service Platform on Cloud Run cuts deployment time from days to under an hour Feedly Summary: Introduction Etsy, a leading ecommerce marketplace for handmade, vintage, and unique items has a passion for delivering innovative and seamless experiences for customers. Like many fast growing companies, Etsy needed to…

  • Cloud Blog: Generative AI with enterprise controls for business users in 24 Hours

    Source URL: https://cloud.google.com/blog/topics/partners/gen-ai-with-enterprise-controls-for-business-users-in-24-hours/ Source: Cloud Blog Title: Generative AI with enterprise controls for business users in 24 Hours Feedly Summary: Aible is a leader in generating business impact from AI in less than 30 days, helping teams use AI to extract enterprise value from raw enterprise data with solutions for customer acquisition, churn prevention, demand…

  • Hacker News: LoRA vs. Full Fine-Tuning: An Illusion of Equivalence

    Source URL: https://arxiv.org/abs/2410.21228 Source: Hacker News Title: LoRA vs. Full Fine-Tuning: An Illusion of Equivalence Feedly Summary: Comments AI Summary and Description: Yes Summary: The paper presents a comparative study of Low-Rank Adaptation (LoRA) and full fine-tuning for large language models (LLMs). It reveals significant differences in how each method alters pre-trained models, particularly focusing…

  • Schneier on Security: Subverting LLM Coders

    Source URL: https://www.schneier.com/blog/archives/2024/11/subverting-llm-coders.html Source: Schneier on Security Title: Subverting LLM Coders Feedly Summary: Really interesting research: “An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection“: Abstract: Large Language Models (LLMs) have transformed code com- pletion tasks, providing context-based suggestions to boost developer productivity in software engineering. As users often…

  • Hacker News: Tencent drops a 389B MoE model(Open-source and free for commercial use))

    Source URL: https://github.com/Tencent/Tencent-Hunyuan-Large Source: Hacker News Title: Tencent drops a 389B MoE model(Open-source and free for commercial use)) Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text introduces the Hunyuan-Large model, the largest open-source Transformer-based Mixture of Experts (MoE) model, developed by Tencent, which boasts 389 billion parameters, optimizing performance while managing resource…

  • Simon Willison’s Weblog: Nous Hermes 3

    Source URL: https://simonwillison.net/2024/Nov/4/nous-hermes-3/#atom-everything Source: Simon Willison’s Weblog Title: Nous Hermes 3 Feedly Summary: Nous Hermes 3 The Nous Hermes family of fine-tuned models have a solid reputation. Their most recent release came out in August, based on Meta’s Llama 3.1: Our training data aggressively encourages the model to follow the system and instruction prompts exactly…

  • AWS News Blog: AWS Weekly Roundup: AWS Lambda, Amazon Bedrock, Amazon Redshift, Amazon CloudWatch, and more (Nov 4, 2024)

    Source URL: https://aws.amazon.com/blogs/aws/aws-weekly-roundup-aws-lambda-amazon-bedrock-amazon-redshift-amazon-cloudwatch-and-more-nov-4-2024/ Source: AWS News Blog Title: AWS Weekly Roundup: AWS Lambda, Amazon Bedrock, Amazon Redshift, Amazon CloudWatch, and more (Nov 4, 2024) Feedly Summary: The spooky season has come and gone now. While there aren’t any Halloween-themed releases, AWS has celebrated it in big style by having a plethora of exciting releases last…

  • Cloud Blog: Flipping out: Modernizing a classic pinball machine with cloud connectivity

    Source URL: https://cloud.google.com/blog/products/application-modernization/connecting-a-pinball-machine-to-the-cloud/ Source: Cloud Blog Title: Flipping out: Modernizing a classic pinball machine with cloud connectivity Feedly Summary: In today’s cloud-centric world, we often take for granted the ease with which we can integrate our applications with a vast array of powerful cloud services. However, there are still countless legacy systems and other constrained…

  • Cloud Blog: A practical guide to synthetic data generation with Gretel and BigQuery DataFrames

    Source URL: https://cloud.google.com/blog/products/data-analytics/synthetic-data-generation-with-gretel-and-bigquery-dataframes/ Source: Cloud Blog Title: A practical guide to synthetic data generation with Gretel and BigQuery DataFrames Feedly Summary: In our previous post, we explored how integrating Gretel with BigQuery DataFrames streamlines synthetic data generation while preserving data privacy. To recap, BigQuery DataFrames is a Python client for BigQuery, providing pandas-compatible APIs with…

  • Hacker News: AMD Open-Source 1B OLMo Language Models

    Source URL: https://www.amd.com/en/developer/resources/technical-articles/introducing-the-first-amd-1b-language-model.html Source: Hacker News Title: AMD Open-Source 1B OLMo Language Models Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses AMD’s development and release of the OLMo series, a set of open-source large language models (LLMs) designed to cater to specific organizational needs through customizable training and architecture adjustments. This…