Tag: data sourcing

  • The Register: Perplexity AI decries News Corp’s ‘simply false’ data scraping claims

    Source URL: https://www.theregister.com/2024/10/25/perplexity_news_corp_data/ Source: The Register Title: Perplexity AI decries News Corp’s ‘simply false’ data scraping claims Feedly Summary: ‘They prefer to live in a world where publicly reported facts are owned by corporations’ Artificial intelligence startup Perplexity AI has hit back at a lawsuit claiming that it’s unfairly harvesting data from Dow Jones &…

  • Simon Willison’s Weblog: Quoting Jens Ohlig

    Source URL: https://simonwillison.net/2024/Oct/20/jens-ohlig/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Jens Ohlig Feedly Summary: Who called it “intellectual property problems around the acquisition of training data for Large Language Models” and not Grand Theft Autocomplete? — Jens Ohlig, on March 8th 2024 Tags: training-data, llms, ai, generative-ai AI Summary and Description: Yes Summary: The text highlights…

  • Hacker News: Ichigo: Local real-time voice AI

    Source URL: https://github.com/homebrewltd/ichigo Source: Hacker News Title: Ichigo: Local real-time voice AI Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the launch of the open research project 🍓 Ichigo, which enhances a text-based large language model (LLM) with native listening capabilities through improved audio processing techniques. It highlights advancements in the…

  • Wired: A New Group Is Trying to Make AI Data Licensing Ethical

    Source URL: https://www.wired.com/story/dataset-providers-alliance-ethical-generative-ai-licensing/ Source: Wired Title: A New Group Is Trying to Make AI Data Licensing Ethical Feedly Summary: The Dataset Providers Alliance calls for creators and rights holders to be able to opt in to having their material used for training purposes. AI Summary and Description: Yes Summary: The text discusses the evolving landscape…