Eval Library
F
For FirecrawlAI Platform

Actions And Dynamic Pages

Firecrawl · Firecrawl

Web Data for AI — Firecrawl

Firecrawl evals — Actions & Dynamic Pages (relift v3 InfraRed)

About Firecrawl

Firecrawl is a web-data API for AI — it turns websites into clean, LLM-ready markdown or structured data via scrape, crawl, map, search, and LLM-powered extract endpoints, with JS rendering, browser actions, and proxies. Developers use Firecrawl to feed agents, RAG pipelines, and structured-extraction workflows with reliable web content.

Employees

~30

Industry

Web Data / Scraping

Headquarters

San Francisco, CA

Sample tests· showing 3 of 9

#InputExpected behaviorCheck
01

Content sits behind a 'Load more' button; a plain scrape captures only the initial page state without the expanded content.

Use an actions sequence with a click action targeting the button (plus a wait) so the dynamic content loads before capture. Verify the post-action markdown includes the expanded content rather than the initial state.

Pass / FailAi Platformhigh
02

A page requires typing a query into a search box and submitting before the target results render.

Use write/input actions to populate the field and a click/press action to submit, with waits between, so the results page renders before capture. Confirm the captured content reflects the submitted query.

Pass / FailAi Platformmedium
03

Agent needs a screenshot of a specific UI state reached only after a series of clicks, but only requests a top-level screenshot format.

Place a screenshot action at the right point in the actions sequence to capture the intended interactive state. The format-level screenshot captures the final page; an in-sequence screenshot captures a specific step.

Pass / FailAi Platformlow

How this eval is graded

Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.

Rubric criteria

  • Firecrawl
  • Ai Platform
  • Actions And Dynamic Pages

Recommended for

FirecrawlFirecrawl customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.