data licensing – Experimental News Clipping Site

Slashdot: Reddit Wants ‘Deeper Integration’ with Google in Exchange for Licensed AI Training Data

Sep 22, 2025

—

by

Source URL: https://tech.slashdot.org/story/25/09/22/0313234/reddit-wants-deeper-integration-with-google-in-exchange-for-licensed-ai-training-data?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Reddit Wants ‘Deeper Integration’ with Google in Exchange for Licensed AI Training Data Feedly Summary: AI Summary and Description: Yes Summary: The text discusses Reddit’s ongoing negotiations with Google for a new deal that involves deeper integration with AI products and a dynamic pricing structure for licensing its data.…

Slashdot: RSS Co-Creator Launches New Protocol For AI Data Licensing

Sep 11, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://tech.slashdot.org/story/25/09/10/2320207/rss-co-creator-launches-new-protocol-for-ai-data-licensing?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: RSS Co-Creator Launches New Protocol For AI Data Licensing Feedly Summary: AI Summary and Description: Yes Summary: The Real Simple Licensing (RSL) initiative seeks to standardize and simplify the licensing of online content for AI training, backed by major publishers such as Reddit and Medium. It aims to create…

The Register: Coordinates of millions of smartphones feared stolen, sparking yet another lawsuit against data broker

Feb 6, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.theregister.com/2025/02/06/gravy_analytics_data_breach_suit/ Source: The Register Title: Coordinates of millions of smartphones feared stolen, sparking yet another lawsuit against data broker Feedly Summary: Fourth time’s the harm? Gravy Analytics has been sued yet again for allegedly failing to safeguard its vast stores of personal data, which are now feared stolen. And by personal data we…

Simon Willison’s Weblog: Releasing the largest multilingual open pretraining dataset

Nov 14, 2024

—

by

system automation

in Uncategorized

Source URL: https://simonwillison.net/2024/Nov/14/releasing-the-largest-multilingual-open-pretraining-dataset/#atom-everything Source: Simon Willison’s Weblog Title: Releasing the largest multilingual open pretraining dataset Feedly Summary: Releasing the largest multilingual open pretraining dataset Common Corpus is a new “open and permissible licensed text dataset, comprising over 2 trillion tokens (2,003,039,184,047 tokens)" released by French AI Lab PleIAs. This appears to be the largest available…

Wired: This Startup Wants YouTube Creators to Get Paid for AI Training Data

Sep 30, 2024

—

by

system automation

in Uncategorized

Source URL: https://www.wired.com/story/license-to-scrape-youtube-ai-data-license-creators/ Source: Wired Title: This Startup Wants YouTube Creators to Get Paid for AI Training Data Feedly Summary: While big platforms like Reddit have signed deals with the AI giants, YouTube leaves licensing in the hands of individual creators. The “License to Scrape” program aims to give those streaming stars proper leverage. AI…

Tag: data licensing

Slashdot: Reddit Wants ‘Deeper Integration’ with Google in Exchange for Licensed AI Training Data

Slashdot: RSS Co-Creator Launches New Protocol For AI Data Licensing

The Register: Coordinates of millions of smartphones feared stolen, sparking yet another lawsuit against data broker

Simon Willison’s Weblog: Releasing the largest multilingual open pretraining dataset

Wired: This Startup Wants YouTube Creators to Get Paid for AI Training Data