When to Use
- Small tables — Tables with fewer than a few hundred thousand rows.
- Sources without incremental support — Some SaaS APIs or file sources only support full reads.
- Complete refresh — When you want the destination to exactly match the source state each run.
- Infrequent syncs — When the pipeline runs rarely (e.g. daily or weekly), full sync may be acceptable even for larger tables.
How It Works
- The pipeline connects to the source.
- It reads every row from the selected stream(s).
- It applies transformation rules (if any).
- It writes to the destination according to the write mode (append, replace, or upsert).
Limitations
- Performance — Every run processes the entire dataset. For large tables (millions of rows), this can be slow and resource-intensive.
- Source load — The source may be under heavy load during a full sync. Consider off-peak scheduling.
- Network — Large full syncs transfer more data than incremental syncs.