Data for AI
Train your LLMs and MT engines with high-quality domain-specific multilingual corpora, carefully curated by TAUS data experts. Explore our offering below or get in touch to obtain the data you need.
Up to 97.3% Discount !
This spring, TAUS offers its entire collection of multilingual training data for sale at discounts of up to 97.3% of the original price. The end data of this data sale is extended from April 30 until June 30.
The data collection on offer contains close to 7.4 billion words in 483 language pairs. Fill in the form below to download the Data Catalog, Pricing & Terms and the History of TAUS Data.
Download Data Catalog & Pricing