> For the complete documentation index, see [llms.txt](https://docs.portexai.com/portex-docs/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.portexai.com/portex-docs/for-model-builders/browsing-evals.md).

# Browsing Evals

The Explore page at [datalab.portexai.com](https://datalab.portexai.com) is the main entry point for discovering evals and datasets.

## Explore Page

Use the top tabs to filter by content type: Featured, Evals, Datasets, RFPs, or All.

<figure><img src="/files/vu8jlZJnvYaElMueDhsI" alt=""><figcaption></figcaption></figure>

## Filters

The left sidebar provides filters to narrow results:

* SOC Occupations: filter evals by O\*NET-SOC occupational titles (e.g., Aerospace Engineers, Chemists, Clinical Psychologists)
* Eval Average Difficulty: score ranges from "mostly failures" (< 20%) to "strong" (> 80%)
* Creator Expertise: filter by education, institution, experience level
* Model Performance: filter by how specific models scored (e.g., claude-opus-4.6, gpt-5.2, grok-4)
* Max Runtime (test-time compute/inference): fast (< 1 min), medium (1-10 min), long (10+ min)
* Modality: text, image, code, tabular, audio, video, geospatial, and more

You can combine multiple filters. The card grid updates in real time.

<figure><img src="/files/McmDcuCPU6VNE3qaVWVp" alt=""><figcaption></figcaption></figure>

## Eval Detail Page

Click an eval card to open its detail page.

<figure><img src="/files/WA1uuKqpUWSbqeFz98iL" alt=""><figcaption></figcaption></figure>

The detail page has four tabs:

* Card: the eval description, background, and methodology
* Full Profile: expert info and credentials/socials
* Task Bundles: view and download tasks and reference files
* Leaderboards: model performance rankings with scores, run times, and relative cost
* License: the PSDLA license terms

<figure><img src="/files/lp7hi3lYzYtHUFMr3Xvg" alt=""><figcaption><p>Task Bundles tab</p></figcaption></figure>

The left sidebar shows the seller's profile, number of tasks, SOC occupation, modality, format, and file size.

The right panel shows pricing: the per-run price and (if available) the Core Dataset price and minimum bid.

### Task Viewer

Click "View Tasks" to open the task viewer modal, which shows the full rendered prompt (including LaTeX and math notation).

<figure><img src="/files/1xIkuLCOojGCkNOTF6RU" alt=""><figcaption></figcaption></figure>

### Leaderboards

The Leaderboards tab shows how frontier models perform on the eval, ranked by score with run time and relative cost. Portex administers leaderboards and maintains them as new SOTA models are released.

<figure><img src="/files/NJUr7u34Ed0NV8JG9ILe" alt=""><figcaption></figcaption></figure>


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://docs.portexai.com/portex-docs/for-model-builders/browsing-evals.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
