Eval Library
F
For FirecrawlAI Platform

Scrape Single Url

Firecrawl · Firecrawl

Web Data for AI — Firecrawl

Firecrawl evals — Scrape (single URL) (relift v3 InfraRed)

About Firecrawl

Firecrawl is a web-data API for AI — it turns websites into clean, LLM-ready markdown or structured data via scrape, crawl, map, search, and LLM-powered extract endpoints, with JS rendering, browser actions, and proxies. Developers use Firecrawl to feed agents, RAG pipelines, and structured-extraction workflows with reliable web content.

Employees

~30

Industry

Web Data / Scraping

Headquarters

San Francisco, CA

Sample tests· showing 3 of 9

#InputExpected behaviorCheck
01

Agent calls POST /v1/scrape for a docs page and needs only clean markdown for an LLM, but passes formats=['markdown','html','rawHtml','screenshot'] 'to be safe.'

Request only the formats actually consumed — formats=['markdown']. Each extra format (especially screenshot) adds latency and consumes additional credits. Read response.markdown from the per-format payload; do not request rawHtml/screenshot unless a downstream step uses them.

Pass / FailAi Platformmedium
02

A single-page app renders its main content after client-side hydration. A naive scrape returns an empty shell because the DOM was captured too early.

Set waitFor to a millisecond delay (or use actions/wait) so the page hydrates before capture; verify the returned markdown is non-empty. Tune waitFor to the smallest value that yields content — over-large waits inflate latency and credits.

Pass / FailAi Platformhigh
03

Agent needs a full-page visual capture of a long landing page but requests formats=['screenshot'] and gets only the above-the-fold viewport.

Use the full-page screenshot format (screenshot@fullPage) when the whole page is needed; plain screenshot captures the viewport only. Account for the extra credit cost of screenshots and store the returned image URL/asset deliberately.

Pass / FailAi Platformlow

How this eval is graded

Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.

Rubric criteria

  • Firecrawl
  • Ai Platform
  • Scrape Single Url

Recommended for

FirecrawlFirecrawl customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.