Bespoke Labs
  • Welcome
  • BESPOKE CURATOR
    • Getting Started
      • Quick Tour
      • Key Concepts
      • Visualize your dataset with the Bespoke Curator Viewer
      • Automatic recovery and caching
      • Structured Output
    • Save $$$ on LLM inference
      • Using OpenAI for batch inference
      • Using Anthropic for batch inference
      • Using Gemini for batch inference
      • Using Mistral for batch inference
      • Using kluster.ai for batch inference
    • How-to Guides
      • Using vLLM with Curator
      • Using Ollama with Curator
      • Using LiteLLM with curator
      • Handling Multimodal Data in Curator
      • Executing LLM-generated code
      • Using HuggingFace inference providers with Curator
    • Data Curation Recipes
      • Generating a diverse QA dataset
      • Using SimpleStrat block for generating diverse data
      • Curate Reasoning data with Claude-3.7 Sonnet
      • Synthetic Data for function calling
    • Finetuning Examples
      • Aspect based sentiment analysis
      • Finetuning a model to identify features of a product
    • API Reference
  • Models
    • Bespoke MiniCheck
      • Self-Hosting
      • Integrations
      • API Service
    • Bespoke MiniChart
    • OpenThinker
Powered by GitBook
On this page
  1. BESPOKE CURATOR
  2. Getting Started

Visualize your dataset with the Bespoke Curator Viewer

PreviousKey ConceptsNextAutomatic recovery and caching

Last updated 1 month ago

IMPORTANT: We recently are retiring our local curator-viewer , now switching to a hosted Bespoke Curator Viewer.

The hosted Bespoke Curator Viewer is a rich interface to visualize data, and makes visually inspecting the data much easier.

Example:

Before your Curator run, set the CURATOR_VIEWER environment variable.

Bash:

export CURATOR_VIEWER=1

Python/colab:

import os
os.environ["CURATOR_VIEWER"]="1"

With this enabled, as curator generates data, it gets uploaded and you can see the responses streaming in the viewer. The URL for the viewer is displayed right next to the rich progress.

Then, run on any Bespoke Curator scripts, here we use examples/poem-generation/poem.py as an example, and

The line

Curator Viewer: ✨ Open Curator Viewer ✨ 

contains a clickable link (the following is an example viewer link, the link changes for each new Curator generation run) which opens the hosted Bespoke Curator Viewer in a new browser tab:

Authenticate with a Bespoke Labs API key

By default, datasets are accessible to anyone with the link. To keep your datasets private, you can associate them with a Bespoke Labs account. Doing so also allows you to:

  • Track all datasets associated with your account

  • Share datasets with collaborators

  • Analyze data generation costs over time

You can enable authentication as follows:

  1. Set the BESPOKE_API_KEY and CURATOR_VIEWER environment variables:

export BESPOKE_API_KEY=<YOUR_API_KEY>
export CURATOR_VIEWER=1

With these variables set, all your datasets will be streamed to the hosted viewer and linked to your Bespoke Labs account.

for a Bespoke Labs account.

Create an API key from the page.

You can visit the page to see all the datasets generated with your API keys or shared with you by others.

You can also visit the page to see the data generation costs for a given period.

https://curator.bespokelabs.ai/datasets/845c7dd33b8b4242a24ff048b5f94354
Sign up
API Key
Datasets
Cost Report
Example Bespoke Curator Rich CLI logs on synthetic poems
Example Bespoke Curator Viewer on synthetic poem topics
link