Tag: fine-tuning

Source URL: https://cloud.google.com/blog/topics/partners/gen-ai-with-enterprise-controls-for-business-users-in-24-hours/ Source: Cloud Blog Title: Generative AI with enterprise controls for business users in 24 Hours Feedly Summary: Aible is a leader in generating business impact from AI in less than 30 days, helping teams use AI to extract enterprise value from raw enterprise data with solutions for customer acquisition, churn prevention, demand…

Hacker News: LoRA vs. Full Fine-Tuning: An Illusion of Equivalence

Nov 8, 2024

—

by

Source URL: https://arxiv.org/abs/2410.21228 Source: Hacker News Title: LoRA vs. Full Fine-Tuning: An Illusion of Equivalence Feedly Summary: Comments AI Summary and Description: Yes Summary: The paper presents a comparative study of Low-Rank Adaptation (LoRA) and full fine-tuning for large language models (LLMs). It reveals significant differences in how each method alters pre-trained models, particularly focusing…

Schneier on Security: Subverting LLM Coders

Nov 7, 2024

—

by

Source URL: https://www.schneier.com/blog/archives/2024/11/subverting-llm-coders.html Source: Schneier on Security Title: Subverting LLM Coders Feedly Summary: Really interesting research: “An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection“: Abstract: Large Language Models (LLMs) have transformed code com- pletion tasks, providing context-based suggestions to boost developer productivity in software engineering. As users often…

Hacker News: Tencent drops a 389B MoE model(Open-source and free for commercial use))

Nov 5, 2024

—

by

Source URL: https://github.com/Tencent/Tencent-Hunyuan-Large Source: Hacker News Title: Tencent drops a 389B MoE model(Open-source and free for commercial use)) Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text introduces the Hunyuan-Large model, the largest open-source Transformer-based Mixture of Experts (MoE) model, developed by Tencent, which boasts 389 billion parameters, optimizing performance while managing resource…

Simon Willison’s Weblog: Nous Hermes 3

—

by

Source URL: https://simonwillison.net/2024/Nov/4/nous-hermes-3/#atom-everything Source: Simon Willison’s Weblog Title: Nous Hermes 3 Feedly Summary: Nous Hermes 3 The Nous Hermes family of fine-tuned models have a solid reputation. Their most recent release came out in August, based on Meta’s Llama 3.1: Our training data aggressively encourages the model to follow the system and instruction prompts exactly…

AWS News Blog: AWS Weekly Roundup: AWS Lambda, Amazon Bedrock, Amazon Redshift, Amazon CloudWatch, and more (Nov 4, 2024)

—

by

Source URL: https://aws.amazon.com/blogs/aws/aws-weekly-roundup-aws-lambda-amazon-bedrock-amazon-redshift-amazon-cloudwatch-and-more-nov-4-2024/ Source: AWS News Blog Title: AWS Weekly Roundup: AWS Lambda, Amazon Bedrock, Amazon Redshift, Amazon CloudWatch, and more (Nov 4, 2024) Feedly Summary: The spooky season has come and gone now. While there aren’t any Halloween-themed releases, AWS has celebrated it in big style by having a plethora of exciting releases last…

Cloud Blog: Flipping out: Modernizing a classic pinball machine with cloud connectivity

—

by

Source URL: https://cloud.google.com/blog/products/application-modernization/connecting-a-pinball-machine-to-the-cloud/ Source: Cloud Blog Title: Flipping out: Modernizing a classic pinball machine with cloud connectivity Feedly Summary: In today’s cloud-centric world, we often take for granted the ease with which we can integrate our applications with a vast array of powerful cloud services. However, there are still countless legacy systems and other constrained…

Cloud Blog: A practical guide to synthetic data generation with Gretel and BigQuery DataFrames

—

by

Source URL: https://cloud.google.com/blog/products/data-analytics/synthetic-data-generation-with-gretel-and-bigquery-dataframes/ Source: Cloud Blog Title: A practical guide to synthetic data generation with Gretel and BigQuery DataFrames Feedly Summary: In our previous post, we explored how integrating Gretel with BigQuery DataFrames streamlines synthetic data generation while preserving data privacy. To recap, BigQuery DataFrames is a Python client for BigQuery, providing pandas-compatible APIs with…

Hacker News: AMD Open-Source 1B OLMo Language Models

Nov 1, 2024

—

by