Skip to main content

Sync Reliability & Recovery

Vector Data Loader uses a job-based sync system with built-in retry logic, progress checkpointing, and automatic recovery. This means your sync operations are resilient to temporary failures, timeouts, and network issues.

How Sync Jobs Work

Every sync operation — whether triggered manually, via bulk resync, or by auto-sync — creates a sync job that processes documents in the background.

Job Lifecycle

StageWhat Happens
CreatedJob is created with a list of document IDs to process
RunningDocuments are processed one at a time, with progress saved after each
CheckpointedIf the job approaches a timeout, progress is saved and processing continues automatically
CompletedAll documents processed successfully
Completed with ErrorsSome documents succeeded, some failed
FailedAll documents failed to sync

Progress Tracking

When you start a sync, a Sync Progress Modal appears showing real-time progress:

  • Current document being processed
  • Success and failure counts
  • Overall completion percentage
  • Error details for any failed documents

You can close the modal and continue working — the sync continues in the background. Reopen the Documents page to see updated statuses.

tip

You can sync up to 20 documents at once. For 5 or fewer documents, sync begins immediately. For 6-20 documents, processing runs in the background.

Automatic Retry Logic

When a document fails to sync, the system classifies the error and decides whether to retry automatically.

Error Classification

CategoryExamplesRetried?Max Attempts
TransientTimeouts, rate limits (429), server errors (500/502/503), network failuresYes5
PermanentUnauthorized (401), forbidden (403), not found (404), insufficient creditsNo0
UnknownUnrecognized errors that don't match known patternsYes3

Transient Errors (Auto-Retried)

These are temporary issues that usually resolve on their own:

  • Connection timeouts — the source server was slow to respond
  • Rate limiting (429) — too many API requests in a short period
  • Server errors (500, 502, 503, 504) — the source service had a temporary outage
  • Network failures — brief connectivity interruption

The system uses exponential backoff between retry attempts, starting at 30 seconds and doubling each time (up to 1 hour maximum). A small random delay (jitter) is added to prevent multiple retries from colliding.

AttemptApproximate Wait
1st retry~30 seconds
2nd retry~1 minute
3rd retry~2 minutes
4th retry~4 minutes
5th retry~8 minutes

Permanent Errors (Not Retried)

These indicate a problem that won't resolve on its own:

  • Authentication expired — reconnect the source in Settings
  • Document deleted — the source document no longer exists
  • Insufficient credits — purchase more credits in Settings > Billing
  • Configuration missing — set up your embedding or vector store provider
warning

Permanent errors require you to fix the underlying issue (reconnect a source, add credits, etc.) and then manually re-sync the affected documents.

Crash Recovery

If a sync job is interrupted — for example, due to a server restart or function timeout — the system automatically recovers.

How Recovery Works

  1. Progress is checkpointed after each document, so completed work is never lost
  2. A background health check runs periodically to detect interrupted jobs
  3. Interrupted jobs are automatically resumed from where they left off
  4. Each job can be resumed up to 3 times before being marked as failed

What You'll See

  • Documents that were already synced before the interruption keep their "Synced" status
  • Remaining documents are retried automatically
  • If a job cannot be resumed, it's marked as failed and you can start a new sync
info

You never need to manually intervene for crash recovery. The system handles it automatically. If you see a job stuck in "Running" status for more than an hour, it will be cleaned up by the next health check.

Sync Health Dashboard

The Dashboard includes a Sync Health panel that gives you visibility into your sync reliability over the past 7 days:

MetricDescription
Job OutcomesCount of succeeded, partially succeeded, and failed sync jobs
Document Success RatePercentage of documents that synced successfully
Average DurationTypical time for a sync job to complete
Errors by SourceWhich connectors are experiencing the most errors
Top ErrorsMost frequent error messages across all sync jobs

Use this panel to spot patterns — for example, if a particular source connector is consistently failing, you may need to reconnect it or check its API credentials.

Limits & Constraints

LimitValuePurpose
Max documents per sync20Prevents oversized jobs
Max resume attempts3Prevents infinite retry loops
Stale job threshold1 hourJobs running longer are considered stuck
Max retry attempts (transient)5Generous retries for temporary issues
Max retry attempts (unknown)3Moderate retries for unclassified errors
One job at a timePer organizationPrevents resource contention

Best Practices

  1. Start small — When syncing a new source for the first time, sync a few documents first to verify your configuration works
  2. Check the Sync Health panel — After bulk operations, review the dashboard to spot any failures
  3. Fix permanent errors promptly — Expired OAuth tokens or missing API keys won't resolve on their own
  4. Don't re-sync during an active job — Wait for the current job to complete before starting another
  5. Use auto-sync for ongoing maintenance — Configure auto-sync in Settings to keep documents fresh automatically