Efficient AI · Sapient Intelligence
HRM-Text: A 1B Model Trained From Scratch for $1,500
HRM-Text trains a 1B language model from scratch on 40B tokens for about $1,500, scoring 60.7% MMLU, 84.5% GSM8K and 56.2% MATH by swapping Transformers for a hierarchical recurrent model.