Run Extraction
Extract structured data from selected documents.
Run Extraction
Once you have a schema, run extraction on documents to produce structured results.
Steps
- Navigate to Extract in the sidebar
- Click New Run in the top-right corner
- Select a schema from the dropdown
- Select documents (see options below)
- Click Start Extraction
You'll be redirected to the run detail page where results appear as processing completes.
Selecting documents
Option A: Individual documents
- Click the documents dropdown
- Check boxes next to documents to include
- Select as many as needed
Option B: Select a folder
- Click the folder dropdown
- Select a folder (shows document count)
- Confirm in the dialog
Note: You can only use one option. To switch, clear your current selection.
Folder limits: Folders must have 1-50 documents. For larger collections, split across multiple runs.
Shortcut from Files
- Go to Files
- Select documents using checkboxes
- Click Extract in the toolbar
- You'll be taken to New Extraction with documents pre-selected
Monitoring progress
The run detail page shows:
| Status | Meaning |
|---|---|
| Queued | Waiting to start |
| Running (X/Y) | X of Y documents completed |
| Complete | Finished successfully |
| Failed | Error occurred |
| Cancelled | You stopped the run |
The page auto-refreshes every 5 seconds while processing.
Canceling a run
- Click the actions menu (⋮)
- Select Cancel
The run stops after the current document finishes. Partial results are preserved.
Viewing results
When complete, you'll see a table with:
- Document — Source document name
- Schema fields — One column per field
- Source Quote — The text passage used for extraction
- Chunk — Which section it came from (hidden by default)
Column visibility
Click Columns to show/hide columns.
Understanding values
| Type | Display |
|---|---|
| Text, Number | Value as-is |
| Boolean | "Yes" or "No" |
| Enum | Matched value from your list |
| Array | JSON list like ["value1", "value2"] |
| Object | Dot notation columns (Address.City, Address.State) |
| Missing | "—" |
Exporting results
- Click Export CSV (appears when run completes)
- Download opens automatically
The CSV includes all fields plus provenance columns.
Opening exports
Compatible with Excel, Google Sheets, Numbers, or any data tool.
Processing time
Extraction processes roughly 10 pages per minute.
| Documents | Pages each | Total pages | Estimated time |
|---|---|---|---|
| 10 | 20 | 200 | ~20 minutes |
| 50 | 20 | 1,000 | ~100 minutes |
| 50 | 50 | 2,500 | ~4 hours |
Large runs can be left running — check back later.