feat(evals): add page by benjamincanac · Pull Request #2182 · nuxt/nuxt.com

benjamincanac · 2026-02-12T17:57:11Z

🔗 Linked issue

❓ Type of change

📖 Documentation (updates to the documentation or readme)
🐞 Bug fix (a non-breaking change that fixes an issue)
👌 Enhancement (improving an existing functionality)
✨ New feature (a non-breaking change that adds functionality)
🧹 Chore (updates to the build process or auxiliary tools and libraries)
⚠️ Breaking change (fix or feature that would cause existing functionality to change)

📚 Description

vercel · 2026-02-12T17:57:16Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
nuxt	Ready	Preview, Comment	Feb 26, 2026 10:32am

atinux · 2026-02-12T21:43:22Z

Could you add a link from it in the footer?

benjamincanac · 2026-02-12T21:47:43Z

@atinux in which section?

atinux · 2026-02-12T22:21:03Z

Explore > AI Evals
And we move Showcase to Enterprise > Showcase

Another suggestion is to remove Modules in the footer as already in the header now and have:
Explore:

Templates
Showcase
AI Evals

coderabbitai · 2026-02-17T11:43:30Z

📝 Walkthrough

Walkthrough

Adds an AI Evals feature: footer navigation removes an external Modules link, updates Explore links to internal paths (/templates, /showcase), and adds an AI Evals item (/evals). Introduces a new page at app/pages/evals.vue that loads public agent-results data and renders an agent evaluation dashboard with filters, computed model/agent summary rows, expandable per-evaluation details, and SEO metadata. Adds a content collection evals in content.config.ts and content/evals.yml metadata, and adds public/agent-results.json with static evaluation results.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'feat(evals): add page' accurately summarizes the main change—adding a new evals page feature to the application.
Description check	✅ Passed	The PR description is related to the changeset as it specifies the type of change as a new feature, though it lacks detailed explanation of what the evals page does.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feat/evals

Tip

Try Coding Plans. Let us write the prompt for your AI agent so you can ship faster (with fewer bugs).
Share your feedback on Discord.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (4)

content.config.ts (1)
425-433: githubUrl should use .url() validation for consistency.

Other URL fields in this config (e.g., demo, purchase, website) use z.string().url(). Apply the same validation here.
♻️ Suggested fix
     evals: defineCollection({
       type: 'data',
       source: 'evals.yml',
       schema: z.object({
         title: z.string(),
         description: z.string(),
-        githubUrl: z.string()
+        githubUrl: z.string().url()
       })
     })
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@content.config.ts` around lines 425 - 433, The evals collection schema
currently validates githubUrl with z.string(); update the schema in the
defineCollection call for evals to use z.string().url() for githubUrl so it
matches other URL fields (demo, purchase, website) and enforces proper URL
validation; locate the evals defineCollection and replace the githubUrl schema
entry accordingly.
app/pages/evals.vue (2)
50-53: Fetching own static asset via full site URL is suboptimal for SSR.

$fetch(joinURL(url, '/agent-results.json')) makes an external HTTP request to the app's own origin during SSR. Since this is a file in public/, you can simplify to $fetch('/agent-results.json') — Nuxt's $fetch handles this internally during SSR without a network round-trip.
♻️ Suggested fix
 const [{ data: page }, { data: rawData }] = await Promise.all([
   useAsyncData('evals', () => queryCollection('evals').first()),
-  useAsyncData('agent-results', () => $fetch<AgentResultsData>(joinURL(url, '/agent-results.json')))
+  useAsyncData('agent-results', () => $fetch<AgentResultsData>('/agent-results.json'))
 ])
This also removes the need for the useSiteConfig() call on line 48 and the joinURL import on line 4 (if unused elsewhere).
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@app/pages/evals.vue` around lines 50 - 53, The $fetch call is making an
external request via joinURL(url, '/agent-results.json') during SSR; change the
useAsyncData invocation that fetches agent-results to call
$fetch('/agent-results.json') instead (e.g., inside the second useAsyncData for
'agent-results') so Nuxt serves the public file without a network round-trip,
and remove the now-unused useSiteConfig() call and joinURL import if they are no
longer referenced.
122-134: Model avatar mapping is brittle and will need manual updates for every new model.

This works for the current dataset but consider externalizing this mapping (e.g., into the JSON data or a config file) so new models don't require code changes.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@app/pages/evals.vue` around lines 122 - 134, getModelAvatar currently
hardcodes string checks for model names making it brittle; refactor by moving
the model-to-avatar mapping out of the function into a configurable data source
(e.g., a JSON config or a constant imported from a config module) and update
getModelAvatar to look up the lowercased model key in that mapping with a
sensible default fallback; specifically, create a mapping object (or load JSON)
keyed by normalized model identifiers, import/use it inside getModelAvatar,
replace the series of if checks with a single map lookup, and ensure the
function still returns undefined or a default avatar when no match is found.
app/composables/useNavigation.ts (1)
136-145: Inconsistent URL format: relative vs absolute paths in the same footer section.

Templates and Showcase use absolute URLs (https://nuxt.com/...) while the new AI Evals entry uses a relative path (/evals). Consider using consistent path formats within the same section.
♻️ Suggested fix
   }, {
     label: 'AI Evals',
-    to: '/evals'
+    to: 'https://nuxt.com/evals'
   }]
Or alternatively, make Templates and Showcase relative too.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@app/composables/useNavigation.ts` around lines 136 - 145, The footer
navigation "children" array in useNavigation.ts mixes absolute URLs ('Templates'
and 'Showcase' use https://nuxt.com/...) with a relative path for 'AI Evals'
('/evals'), causing inconsistent link behavior; update the entry for 'AI Evals'
(the object with label 'AI Evals' inside the children array) to use the same URL
format as the other items (e.g., change to 'https://nuxt.com/evals') or
alternatively make the 'Templates' and 'Showcase' entries relative to match
'/evals'—apply the change in the children array where these objects are defined.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@app/pages/evals.vue`:
- Around line 85-99: The computed allResults can divide by zero when evals is
empty; update the loop inside the allResults computed to guard against
evals.length === 0 by computing successRate only when evals.length > 0 (e.g.,
set successRate to 0 or a safe fallback), i.e. compute successes from evals, set
const denom = evals.length, then use denom ? Math.round((successes/denom)*100) :
0 when building the ModelRow object (referencing allResults, ModelRow, evals,
successes, and successRate).

---

Nitpick comments:
In `@app/composables/useNavigation.ts`:
- Around line 136-145: The footer navigation "children" array in
useNavigation.ts mixes absolute URLs ('Templates' and 'Showcase' use
https://nuxt.com/...) with a relative path for 'AI Evals' ('/evals'), causing
inconsistent link behavior; update the entry for 'AI Evals' (the object with
label 'AI Evals' inside the children array) to use the same URL format as the
other items (e.g., change to 'https://nuxt.com/evals') or alternatively make the
'Templates' and 'Showcase' entries relative to match '/evals'—apply the change
in the children array where these objects are defined.

In `@app/pages/evals.vue`:
- Around line 50-53: The $fetch call is making an external request via
joinURL(url, '/agent-results.json') during SSR; change the useAsyncData
invocation that fetches agent-results to call $fetch('/agent-results.json')
instead (e.g., inside the second useAsyncData for 'agent-results') so Nuxt
serves the public file without a network round-trip, and remove the now-unused
useSiteConfig() call and joinURL import if they are no longer referenced.
- Around line 122-134: getModelAvatar currently hardcodes string checks for
model names making it brittle; refactor by moving the model-to-avatar mapping
out of the function into a configurable data source (e.g., a JSON config or a
constant imported from a config module) and update getModelAvatar to look up the
lowercased model key in that mapping with a sensible default fallback;
specifically, create a mapping object (or load JSON) keyed by normalized model
identifiers, import/use it inside getModelAvatar, replace the series of if
checks with a single map lookup, and ensure the function still returns undefined
or a default avatar when no match is found.

In `@content.config.ts`:
- Around line 425-433: The evals collection schema currently validates githubUrl
with z.string(); update the schema in the defineCollection call for evals to use
z.string().url() for githubUrl so it matches other URL fields (demo, purchase,
website) and enforces proper URL validation; locate the evals defineCollection
and replace the githubUrl schema entry accordingly.

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@app/pages/evals.vue`:
- Around line 48-60: Replace the absolute fetch with a relative path and
propagate async errors before turning missing data into 404s: change the $fetch
call to use '/agent-results.json' instead of joinURL(url,
'/agent-results.json'), and destructure useAsyncData responses to include error
refs (e.g. const [{ data: page, error: pageError }, { data: rawData, error:
rawDataError }] = await Promise.all([...])). First check and throw the async
errors (throw pageError.value or rawDataError.value / createError with their
message/status) if present, then only after no errors check for missing
page.value or rawData.value and throw the 404 createError for not-found.

ℹ️ Review info

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between e43b50f and 8d85102.

📒 Files selected for processing (4)

app/composables/useNavigation.ts
app/pages/evals.vue
content.config.ts
public/agent-results.json

✅ Files skipped from review due to trivial changes (1)

public/agent-results.json

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

app/pages/evals.vue (1)
239-239: Consider simplifying the guard condition.

Since rawData is now a static import (always defined at build time), the rawData check in v-if="page && rawData" is redundant. You could simplify to v-if="page".
♻️ Proposed simplification
-  <div v-if="page && rawData">
+  <div v-if="page">
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@app/pages/evals.vue` at line 239, The template guard currently uses
v-if="page && rawData" but rawData is a static import and always defined; update
the template to remove the redundant check by changing the conditional to
v-if="page" (modify the element that currently uses v-if="page && rawData" to
use only v-if="page"), ensuring no other logic depends on rawData being falsy.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@app/pages/evals.vue`:
- Around line 32-38: Remove the unused AgentResultsData interface declaration:
locate the interface named AgentResultsData and delete its full declaration
block, and verify there are no remaining references to AgentResultsData
elsewhere (if any, replace with the inferred/static type or the concrete type
from the imported JSON). Ensure the file compiles after removal.

---

Nitpick comments:
In `@app/pages/evals.vue`:
- Line 239: The template guard currently uses v-if="page && rawData" but rawData
is a static import and always defined; update the template to remove the
redundant check by changing the conditional to v-if="page" (modify the element
that currently uses v-if="page && rawData" to use only v-if="page"), ensuring no
other logic depends on rawData being falsy.

ℹ️ Review info

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 8d85102 and 8cee4b2.

📒 Files selected for processing (1)

app/pages/evals.vue

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

app/pages/evals.vue (1)

91-96: Reset expanded state when the agent filter changes.

Keeping old expansion state after filtering can leave expansion UI out of sync with the newly visible rows.

♻️ Suggested adjustment

-import { h, resolveComponent } from 'vue'
+import { h, resolveComponent, watch } from 'vue'
@@
-const expanded = ref({})
+const expanded = ref<Record<string, boolean>>({})
+
+watch(selectedAgents, () => {
+  expanded.value = {}
+})

Also applies to: 127-128

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@app/pages/evals.vue` around lines 91 - 96, filteredResults currently filters
allResults by selectedAgents but doesn't reset the UI expansion state, causing
stale expanded rows after filtering; add a watcher on selectedAgents (or
incorporate logic where filteredResults is computed) to clear the expansion
state ref (the ref that tracks expanded rows — e.g., expanded or expandedRows)
whenever selectedAgents changes, and apply the same pattern to the other
agent-filter computed at lines 127-128 so expansion is cleared any time the
agent filter is updated.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@app/pages/evals.vue`:
- Around line 99-103: The computed formattedDate can render "Invalid Date" for
malformed rawData.metadata.exportedAt; update formattedDate to validate the
parsed Date (use new Date(rawData.metadata.exportedAt) then check date.getTime()
or isNaN(date.getTime())/isFinite) and return an empty string (or fallback) when
the date is invalid before calling toLocaleDateString; reference the
formattedDate computed and rawData.metadata.exportedAt when making the change.

---

Nitpick comments:
In `@app/pages/evals.vue`:
- Around line 91-96: filteredResults currently filters allResults by
selectedAgents but doesn't reset the UI expansion state, causing stale expanded
rows after filtering; add a watcher on selectedAgents (or incorporate logic
where filteredResults is computed) to clear the expansion state ref (the ref
that tracks expanded rows — e.g., expanded or expandedRows) whenever
selectedAgents changes, and apply the same pattern to the other agent-filter
computed at lines 127-128 so expansion is cleared any time the agent filter is
updated.

ℹ️ Review info

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 8cee4b2 and e411a8f.

📒 Files selected for processing (1)

app/pages/evals.vue

feat(evals): add page

25c08da

vercel bot had a problem deploying to Preview February 12, 2026 18:01 Failure

fix lint

92fcd3d

vercel bot had a problem deploying to Preview February 12, 2026 18:06 Failure

benjamincanac added 2 commits February 12, 2026 22:27

chore: update agent results

8a357d9

feat(app): display avatar

c7175ef

vercel bot deployed to Preview February 12, 2026 21:34 View deployment

improve expanded button hover state

50b4f43

vercel bot deployed to Preview February 12, 2026 21:51 View deployment

feat: add link in footer

e43b50f

vercel bot deployed to Preview February 13, 2026 13:19 View deployment

benjamincanac marked this pull request as ready for review February 17, 2026 11:39

benjamincanac requested a review from atinux as a code owner February 17, 2026 11:39

coderabbitai bot reviewed Feb 17, 2026

View reviewed changes

Comment thread app/pages/evals.vue

benjamincanac marked this pull request as draft February 17, 2026 13:15

fix(evals): various improvements

29192e6

vercel bot deployed to Preview February 17, 2026 13:52 View deployment

feat: update agent results

e261731

vercel bot deployed to Preview February 18, 2026 11:45 View deployment

feat(evals): update agent results

8c08449

vercel bot had a problem deploying to Preview February 23, 2026 21:36 Failure

feat(evals): update

4c82241

vercel bot deployed to Preview February 24, 2026 22:12 View deployment

Merge branch 'main' into feat/evals

8d85102

benjamincanac marked this pull request as ready for review February 26, 2026 10:01

vercel bot had a problem deploying to Preview February 26, 2026 10:07 Failure

coderabbitai bot reviewed Feb 26, 2026

View reviewed changes

Comment thread app/pages/evals.vue Outdated

atinux added 2 commits February 26, 2026 11:22

chore: import result directly

8cee4b2

Update evals.vue

e411a8f

coderabbitai bot reviewed Feb 26, 2026

View reviewed changes

Comment thread app/pages/evals.vue Outdated

atinux approved these changes Feb 26, 2026

View reviewed changes

atinux merged commit 0b9aa30 into main Feb 26, 2026
6 of 8 checks passed

atinux deleted the feat/evals branch February 26, 2026 10:30

vercel bot deployed to Preview February 26, 2026 10:32 View deployment

coderabbitai bot reviewed Feb 26, 2026

View reviewed changes

Comment thread app/pages/evals.vue

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(evals): add page#2182

feat(evals): add page#2182
atinux merged 13 commits intomainfrom
feat/evals

benjamincanac commented Feb 12, 2026 •

edited

Loading

Uh oh!

vercel bot commented Feb 12, 2026 •

edited

Loading

Uh oh!

atinux commented Feb 12, 2026

Uh oh!

benjamincanac commented Feb 12, 2026

Uh oh!

atinux commented Feb 12, 2026

Uh oh!

coderabbitai bot commented Feb 17, 2026 •

edited

Loading

Walkthrough

Estimated code review effort

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

benjamincanac commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Linked issue

❓ Type of change

📚 Description

Uh oh!

vercel bot commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

atinux commented Feb 12, 2026

Uh oh!

benjamincanac commented Feb 12, 2026

Uh oh!

atinux commented Feb 12, 2026

Uh oh!

coderabbitai bot commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Estimated code review effort

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

benjamincanac commented Feb 12, 2026 •

edited

Loading

vercel bot commented Feb 12, 2026 •

edited

Loading

coderabbitai bot commented Feb 17, 2026 •

edited

Loading