We have a number of very high-volume data sources. Processing these sources on a data warehouse is extremely expensive. We'd love to instead processes and aggregate the data using something like Apache Spark, with the data sitting in Amazon S3. And then load the processed/aggregated dataset into our Data Warehouse.
To that end, it would be amazing if Fivetran could support an Apache Hudi destination on Amazon S3 (or other cloud providers).
Bonus: Even more amazing if Fivetran could sync the previously mentioned processed dataset (also an Apache Hudi on S3 table) into our Data Warehouse!