MoongraphMoongraph

Extraction Reference

Complete reference for extraction schemas, runs, and limits.

Extraction Reference

This page documents all extraction options, field types, status values, and system limits.

Field types

TypeDescriptionExample
TextFree-form text of any lengthNames, descriptions, quotes
NumberInteger or decimal values2024, 3.14, -5
BooleanTrue/false valueDisplays as "Yes" or "No"
EnumOne choice from predefined listMust define at least 2 values
ObjectGroup of related sub-fieldsAddress with city, state, zip
ArrayList of multiple values["value1", "value2"]

Field options

OptionDescription
RequiredField must have a value (empty not allowed)
DescriptionField-specific guidance for the AI

Schema visibility

VisibilityWho can seeWho can edit
PrivateOnly youOnly you
SharedYour teamOnly you
PublicAll usersOnly you
TemplateAll usersAdministrators only

Template schemas cannot be edited or deleted, only cloned.

Run status values

StatusDescriptionActions available
QueuedWaiting to start processingCancel
RunningProcessing documents (shows X/Y progress)Cancel
CompleteSuccessfully finishedView results, Export CSV, Delete
FailedError occurred during processingView error, Delete
CancelledStopped by userView partial results, Delete

Results table columns

ColumnDescriptionDefault visible
DocumentSource document nameYes
[Schema fields]One column per fieldYes
Source QuoteText passage used for extractionYes
ChunkSection/chunk identifierNo

Schema validation rules

RuleError message
Field name required"Field name cannot be empty"
No reserved prefixes"Field name cannot start with '_' or '$'"
Reserved names blocked"[name] is a reserved field name"
Enum minimum"Enum must have at least 2 values"
Enum uniqueness"Duplicate enum value"

Reserved field names

Cannot use: id, created_at, updated_at, _provenance, _meta

CSV export format

Data typeCSV representation
TextPlain text
NumberNumeric value
Booleantrue or false
EnumSelected value
ArrayJSON string: ["val1","val2"]
ObjectFlattened: Field.Subfield

System limits

LimitValue
Documents per folder extraction50
Items extracted per document5,000
Fields per schema50
Schema nesting depth5 levels
Schema definition size64 KB
Minimum enum values2

Processing speed

Extraction processes approximately 10 pages per minute.

ExampleCalculation
100 pages~10 minutes
500 pages~50 minutes
1,000 pages~100 minutes

Troubleshooting

Schema validation errors

ErrorSolution
"Field name cannot be empty"Enter a name for the field
"Field name cannot start with '_'"Rename to not start with underscore
"Enum must have at least 2 values"Add more values or change to Text
"Duplicate enum value"Remove the duplicate

Run failures

CauseSolution
Network/server issuesTry starting a new run
Corrupted documentsRemove problem documents
Very large documentsSplit documents or increase timeout

Missing/empty fields

CauseSolution
Information not in documentExpected behavior
Unclear field nameMake name more specific
Information in unexpected formatAdd extraction instructions

Slow extraction

Processing time depends on document size and schema complexity. This is normal for large documents with many fields.

On this page