Skip to main content

Community

Partitioning on fivetran synced column while syncing the data to Iceberg from MySQL

Answered

Please sign in to leave a comment.

Comments

1 comment

  • Official comment

    Thanks Megha for the request. 
    We have been looking at improvements on the read side improvements including partitioning. We are making improvements in how we sort data during Fivetran Syncs that should better sort data based on the Fivetran_synced value. Query engines like spark should then be able to use Parquet min/max values to prune at the file level.

     

    The intent is to make these changes transparent to make the overall management of your lake seamless. 

    Do you have data to show read side impact of a lack of partitioning? 
    If so, can we take the conversation offline. My email is casey.karst@fivetran.com

    -Casey