Skip to main content

6 docs tagged with "sdk"

View all tags

Datasets

Testing Datasets are the core of the data-driven development process. You can create and define your own dataset with granular details for evaluation purposes.

Experiments (Evaluations)

Evaluations are used to run asynchronous metric computations over datasets. You can create a new evaluation, list all evaluations, and get the status of an evaluation.

Metrics

Metrics allow you to compute metric scores over a single data point or a small batch of data. It is most suitable for Runtime Monitor / Online Evaluation purposes.

Projects

Projects are used to organize and manage your work. All datasets and evaluations are associated with a project.

Prompt Optimization

Prompt optimization is the process of fine-tuning the prompt of a model to improve its performance on a specific task.

Synthetic Dataset Generation

Testing Datasets are the core of the data-driven development process. You can create and define your own dataset with granular details for evaluation purposes.