Ingest streaming data to Apache Hudi tables using AWS Glue and Apache Hudi DeltaStreamer

You can create AWS Glue Spark streaming ETL jobs using either Scala or PySpark that run continuously, consuming data from Amazon MSK, Apache Kafka, and Amazon Kinesis Data Streams and writing it to your

Continue reading