Your First Extraction

Extract structured data from a website in under 5 minutes

This guide walks you through a complete extraction from start to finish.

Prerequisites

  • An Ev3ry account
  • A website URL you want to extract from

1. Add a website

From the dashboard, click Add Website. Enter:

  • URL — the page the agent should start from
  • Name — a label for your own reference
  • Description (optional) — describe what you want, e.g. "Extract upcoming sports events including name, date, venue, and ticket price"

The description guides the agent during exploration. The more specific, the better.

2. Create a schema

Go to your website's detail page and click New Schema (or open the global Schemas library to reuse an existing one).

Define the JSON Schema for the data you want. For a first extraction, a simple object schema works well:

{
  "type": "array",
  "items": {
    "type": "object",
    "properties": {
      "event_name": { "type": "string" },
      "date":       { "type": "string" },
      "venue":      { "type": "string" },
      "price":      { "type": "number" }
    }
  }
}

Tip: Start with 3–5 fields. You can always refine after seeing what the agent returns.

3. Start a run

Click Run. Choose your browser connection method:

  • Built-in browser — runs locally on your machine, best for sensitive data or anti-bot protection
  • Cloud browser — no setup needed if you have an API key configured in Settings

The agent opens a browser and begins working. The live view shows you the browser in real time — you can watch it navigate, click, and extract.

4. Review the results

When the run completes, the extracted data appears as a table. From here:

  • Inspect rows for accuracy
  • Download as JSON or CSV
  • Save as a playbook for future reuse

Troubleshooting

Empty results — the page may load data dynamically. Add "scroll down to load all content" to the website description and re-run.

Wrong data — refine the schema field names and add a description to each property, e.g. "description": "the full event name including series title". The agent uses these hints.

Login wall — the agent landed on a login page. Add a saved login under the website's settings. See Authenticated Pages.

Slow run — first runs explore the page and write extraction scripts. Once you save a playbook, subsequent runs are significantly faster.