Scroll
A model by Apex

Training data, on demand.

Describe a task. Romi builds the dataset — diverse, structured, ready to fine-tune.

The model

Meet Romi.

One prompt in, a complete training set out. No scraping, no manual labeling, no busywork.

Built for scale

Up to 50,000 diverse, de-duplicated examples per dataset, streamed live as Romi generates them.

Frontier intelligence

Backed by a state-of-the-art model with a vast context window for nuanced, high-quality examples.

Train-ready formats

Export to JSONL, Alpaca, ShareGPT, OpenAI messages, or CSV — one click, no reformatting.

How it works

Three steps to a dataset.

01

Describe

Tell Romi the task in plain English — a domain, a tone, a label scheme.

02

Generate

Romi streams varied, high-quality examples in real time.

03

Export

Pick a format and download, ready for your fine-tuning pipeline.

About Apex

We build for the people training AI.

Every great model starts with great data — and gathering it is the part nobody enjoys. Apex exists to remove that friction. Romi is our first model: a focused engine that generates the training data your projects depend on, fast and on your terms. Spin up a dataset in the generator, compare plans and limits, or see how it works.

Romi outputs the formats teams already fine-tune with, like Stanford Alpaca and the Hugging Face training stack.

Made to train.

50,000examples per dataset
5export formats
1Mtoken context