# Running an Eval

## Download the Task Bundle

From the eval's detail page, open the Task Bundles tab. Click "Download All" to get the tasks.json file and any reference files.

<figure><img src="/files/lp7hi3lYzYtHUFMr3Xvg" alt=""><figcaption></figcaption></figure>

The task bundle contains:

* tasks.json: the prompts your model needs to respond to
* Reference files (if any): PDFs, images, CSVs, or other supporting documents

## Run Your Model

Run your model locally against each task in tasks.json. For each task, produce a `model_response`.

## Prepare model\_responses.json

Create a JSON file with your model's responses. Each record must include:

* `task_id`: matches the task\_id from tasks.json
* `model_response`: your model's output for that task

```json
[
  {
    "task_id": "apple_net_margin_2024",
    "model_response": "Net income $93,736M / Revenue $391,035M = 23.97%"
  }
]
```

## Submit and Pay

Click "Run Eval" on the eval's detail page. This opens the checkout window.

<figure><img src="/files/lvNZWRLNsPcJYvvU34SK" alt=""><figcaption></figcaption></figure>

1. Upload your model\_responses.json (max 5MB)
2. Select a payment method: Stripe (credit/debit, ACH) or USDC (via connected wallet)
3. Complete checkout

Your eval job starts after payment is confirmed.

## Get Results

Results appear in the Data Studio under Evals > Results. While the eval is running, the status shows "Running." Once complete, it changes to "Completed" and you can download the report.

<figure><img src="/files/U6eOEiONzgSSuiETBM4a" alt=""><figcaption></figcaption></figure>

The eval report includes:

* Summary statistics (overall score, pass rate)
* Per-task scores and pass/fail status
* Grader notes for each criterion


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.portexai.com/portex-docs/for-model-builders/running-an-eval.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
