Welcome
Last updated
Last updated
Bespoke Curator makes it very easy to create high-quality synthetic data at scale, which you can use to finetune models or use for structured data extraction at scale.
Bespoke Curator is an open-source project:
That comes with a rich Python based library for generating and curating synthetic data.
A Curator Viewer which makes it easy to view the datasets, thus aiding in the dataset creation.
We will also be releasing high-quality datasets that should move the needle on post-training.
Start here.
Bespoke-MiniCheck is powered by Bespoke-MiniCheck-7B model, a best-in-class lightweight model that can be used to detect hallucinations.
It tops the LLM-AggreFact leaderboard
You can use Bespoke-MiniCheck either via:
API service (easiest), or