Simon Willison’s Weblog: TextSynth Server

Source URL: https://simonwillison.net/2024/Nov/21/textsynth-server/
Source: Simon Willison’s Weblog
Title: TextSynth Server

Feedly Summary: TextSynth Server
I’d missed this: Fabrice Bellard (yes, that Fabrice Bellard) has a project called TextSynth Server which he describes like this:

ts_server is a web server proposing a REST API to large language models. They can be used for example for text completion, question answering, classification, chat, translation, image generation, …
It has the following characteristics:

All is included in a single binary. Very few external dependencies (Python is not needed) so installation is easy.
Supports many Transformer variants (GPT-J, GPT-NeoX, GPT-Neo, OPT, Fairseq GPT, M2M100, CodeGen, GPT2, T5, RWKV, LLAMA, Falcon, MPT, Llama 3.2, Mistral, Mixtral, Qwen2, Phi3, Whisper) and Stable Diffusion.
[…]

Unlike many of his other notable projects (such as FFmpeg, QEMU, QuickJS) this isn’t open source – in fact it’s not even source available, you instead can download compiled binaries for Linux or Windows that are available for non-commercial use only.
Commercial terms are available, or you can visit textsynth.com and pre-pay for API credits which can then be used with the hosted REST API there.
This is not a new project: the earliest evidence I could find of it was this July 2019 page in the Internet Archive, which said:

Text Synth is build using the GPT-2 language model released by OpenAI. […] This implementation is original because instead of using a GPU, it runs using only 4 cores of a Xeon E5-2640 v3 CPU at 2.60GHz. With a single user, it generates 40 words per second. It is programmed in plain C using the LibNC library.

Via They all use it – Thorsten Ball
Tags: llms, ai, generative-ai

AI Summary and Description: Yes

Summary: The TextSynth Server, developed by Fabrice Bellard, is a web server that provides a REST API for various large language models (LLMs). It emphasizes ease of installation, supporting multiple Transformer variants, but it is not open-source, which poses implications for its adoption in AI projects.

Detailed Description:

– **TextSynth Server Overview:**
– TextSynth Server is a web server that interfaces with large language models through a REST API.
– It offers functionalities such as text completion, question answering, classification, chat, translation, and image generation.

– **Key Characteristics:**
– **Single Binary:** All components are bundled into a single binary, reducing installation complexity.
– **Minimal Dependencies:** Unlike many similar solutions, it does not require Python, making it more accessible to deploy.
– **Support for Multiple Models:** The server supports a wide array of Transformer variants, which are essential for developers looking for flexibility in model choice. The models include:
– GPT-J, GPT-NeoX, GPT-Neo, and others.
– Stable Diffusion for image generation.

– **Commercial and Usage Restrictions:**
– The software is not open-source, and users can download compiled binaries only for non-commercial use.
– Commercial terms are outlined for users interested in broader applications beyond personal or educational use.
– Additionally, pre-paid API credits are available for accessing the hosted REST API.

– **Historical Context:**
– Although the project is not new, with early evidence dating back to July 2019, it highlights the continued evolution and demand around LLMs.
– Notably, early implementations leverage a CPU architecture rather than a GPU, showcasing efficiency in processing without high-end hardware.

**Implications for Security and Compliance Professionals:**
– The closed-source nature of TextSynth Server raises questions about security, auditing, and compliance, as professionals may need to consider risk factors associated with proprietary software.
– Understanding how it generates language could feed into larger discussions on responsible AI use, particularly around output management and governance.
– The support for numerous models may influence how organizations integrate and manage disparate AI technologies across cloud infrastructure.

This analysis underscores the significance of tooling like TextSynth Server in the landscape of AI, particularly for professionals managing the intersection of technology, security, and compliance.