IT data analytics Archives - Page 20 of 27 - Global Intelligence and Insight Platform: IT Innovation, ETF Investment, plus Health Wellbeing

Best Practices for Data Ingestion with Snowflake: Part 1

June 28, 2022 GeneAka

Alongside our extensive ecosystem of ETL and data ingestion partners who help move data into the Data Cloud, Snowflake offers a wide range of first party methods to meet the different data pipeline needs from batch

Implement a CDC-based UPSERT in a data lake using Apache Iceberg and AWS Glue

June 21, 2022 GeneAka

To solve this use case, we present the following simple architecture that integrates Amazon S3 for the data lake, AWS Glue with the Apache Iceberg connector for ETL (extract, transform, and load), and Athena for

Build an Apache Iceberg data lake using Amazon Athena, Amazon EMR, and AWS Glue

June 21, 2022 GeneAka

In this post, we show you how to use Amazon EMR Spark to create an Iceberg table, load sample books review data, and use Athena to query, perform schema evolution, row-level update and delete, and time travel,

Secure a data lakehouse on Synapse

June 21, 2022 GeneAka

This article describes the design process, principles, and technology choices for using Azure Synapse to build a secure data lakehouse solution. Serverless SQL pool, Apache Spark in Azure Synapse, Azure

Use an AD FS user and Tableau to securely query data in AWS Lake Formation

June 14, 2022 GeneAka

AWS Lake Formation allows you to define and enforce access policies at the database, table, and column level when using Athena queries to read data stored in Amazon S3. In this post, we show you how you can use

Build a multilingual dashboard with Amazon Athena and Amazon QuickSight

June 14, 2022 GeneAka

This post describes how to create multilingual dashboards at the data level by creating new columns that contain the translated text and providing a language selection parameter and associated control to

A serverless operational data lake for retail with AWS Glue, Amazon Kinesis Data Streams, Amazon DynamoDB, and Amazon QuickSight

June 7, 2022 GeneAka

In this post, we create an end-to-end pipeline to ingest, store, process, analyze, and visualize operational data like orders, inventory, and shipment updates. In this post, we demonstrate how to create a

Kubernetes multi-cluster users tap service mesh alternatives

June 7, 2022 GeneAka

Enterprise IT pros tasked with shoring up resiliency among Kubernetes multi-cluster and multi-cloud environments favored open source service mesh projects Linkerd and Kuma over Istio. Kubernetes can handle some

Integrate AWS Glue Schema Registry with the AWS Glue Data Catalog to enable effective schema enforcement in streaming analytics use cases

June 7, 2022 GeneAka

The following high-level architecture diagram shows the components to integrate Schema Registry and the Data Catalog to run streaming ETL jobs. In this post, we demonstrate how to integrate Schema Registry with

Supercharging Dream11’s Data Highway with Amazon Redshift RA3 clusters

June 7, 2022 GeneAka

In this post, we look at how we supercharged our data highway, the backbone of our major analytics pipeline, by migrating our Amazon Redshift clusters to RA3 nodes. After discussions with AWS experts and

How illimity Bank Built a Disaster Recovery Strategy on the Lakehouse

May 31, 2022 GeneAka

Terraform allows teams to manage their Databricks Runtimes as needed in different environments, while all libraries are now stored as Azure Artifacts, avoiding stale package versions. This blog describes the way we developed our data platform DR scenario, guaranteeing RTOs and

Visualize MongoDB data from Amazon QuickSight using Amazon Athena Federated Query

May 31, 2022 GeneAka

In this post, you will learn how to use Amazon Athena Federated Query to connect a MongoDB database to Amazon QuickSight in order to build dashboards and visualizations. Athena uses data source connectors that

Enable Amazon QuickSight federation with Google Workspace

May 31, 2022 GeneAka

In this post, we go through the steps to configure federated single sign-on (SSO) between a Google Workspace instance and QuickSight account. If the SAML authentication response includes attributes that map to multiple AWS Identity and Access Management (IAM) roles, the user is

Analyze Amazon Ion datasets using Amazon Athena

May 24, 2022 GeneAka

Athena now supports querying and creating Ion-formatted datasets via an Ion-specific SerDe, which in conjunction with and allows you to read and write valid Ion data. Let’s run a query that specifies the from our example row of Ion data to verify that we can read from the table:

« 1 … 18 19 20 21 22 … 27 »