Filters
Search filters
Search the tags or the topic text.
Present
Future
Both
Archived
Filter tags
Models
Tokenizers
Research
Current Runs
Bugs
New Ideas
Archived
Topics
Pick one topic to open the detail page.
StenToken Progress report: `english_code` is at `122.3 GB/150gb`, and the main phase is now close to the finish line. Last updated 2026-07-02 Tokenizers · Current Runs · Research · New Ideas
StenToken-FineWeb Archived FineWeb-only tokenizer direction. Last updated 2026-07-02 Tokenizers · Research · New Ideas · Archived
StenToken-FineWeb-Edu Planned StenToken variant trained on 100% FineWeb-Edu for dataset-specific compression. Last updated 2026-07-01 Tokenizers · Research · New Ideas
Stentor4-80M Progress report: the current Stentor4 build is about 80% to 90% through debugging. Last updated 2026-07-02 Models · Current Runs · Bugs · Research · New Ideas
Stentor4-Preview Preview track for Stentor4 using the StenToken schedule so followers can see the model sooner instead of waiting for a separate 32K-tokenizer path. Last updated 2026-06-30 Models · Current Runs · Research · New Ideas
SLM ARENA COMMING SOON!!! The arena is up and running and working, with a couple non-major bugs here and there. Last updated 2026-07-02 Models · Research · New Ideas · Archived