Skip to main content

Community

Connector Improvement: Update the Fivetran sync column only for records that have actually changed

Not planned

Please sign in to leave a comment.

Comments

2 comments

  • Official comment

    Hi Atul,

    Thank you for submitting this request. I understand the goal: you want the Fivetran sync column to update only for records that truly changed.

    Given file sources don't include a record-level modified timestamp, Fivetran's file connectors cannot offer this specific behavior without performance degradation. The hash you are referring to is sampling-based and not something we can use for this purpose.

    For true incremental behavior with file data, the most reliable options are:

    • Exporting incremental slices from the upstream system, if supported, or

    • Implementing destination-side post-processing that isolates the actual changes needed for downstream pipelines.

    Thanks,
    Parmeet

    Hi Parmeet,

    Thank you for your response.

    We are requesting this feature for the following reasons:

    • Fivetran already checks for changes at the primary key level, and if no change is detected, no update is performed, and it does not count toward MAR consumption.

    • Since Fivetran can already determine when a record has not changed, we’re wondering why the FIVETRAN_SYNCED column still needs to be updated in those cases. If this column is required to reflect the latest run, could we instead introduce another column—such as FIVETRAN_RECORD_MODIFIED—that is only updated when an actual change occurs in the record?

    Without this capability, we would be forced to compare millions of incoming records (we receive a full snapshot on each incremental run) with a target table containing tens of millions of rows to calculate deltas in Snowflake. This would significantly increase our Snowflake compute costs.

    Please let me know if you have any questions.

    Thanks,
    Manjeeth