This article describes the design process, principles, and technology choices for using Azure Synapse to build a secure data lakehouse solution. Serverless SQL pool, Apache Spark in Azure Synapse, Azure
This article describes the design process, principles, and technology choices for using Azure Synapse to build a secure data lakehouse solution. Serverless SQL pool, Apache Spark in Azure Synapse, Azure
AWS Lake Formation allows you to define and enforce access policies at the database, table, and column level when using Athena queries to read data stored in Amazon S3. In this post, we show you how you can use
This post describes how to create multilingual dashboards at the data level by creating new columns that contain the translated text and providing a language selection parameter and associated control to
In this post, we create an end-to-end pipeline to ingest, store, process, analyze, and visualize operational data like orders, inventory, and shipment updates. In this post, we demonstrate how to create a
Enterprise IT pros tasked with shoring up resiliency among Kubernetes multi-cluster and multi-cloud environments favored open source service mesh projects Linkerd and Kuma over Istio. Kubernetes can handle some
The following high-level architecture diagram shows the components to integrate Schema Registry and the Data Catalog to run streaming ETL jobs. In this post, we demonstrate how to integrate Schema Registry with
In this post, we look at how we supercharged our data highway, the backbone of our major analytics pipeline, by migrating our Amazon Redshift clusters to RA3 nodes. After discussions with AWS experts and
Terraform allows teams to manage their Databricks Runtimes as needed in different environments, while all libraries are now stored as Azure Artifacts, avoiding stale package versions. This blog describes the way we developed our data platform DR scenario, guaranteeing RTOs and
In this post, you will learn how to use Amazon Athena Federated Query to connect a MongoDB database to Amazon QuickSight in order to build dashboards and visualizations. Athena uses data source connectors that
In this post, we go through the steps to configure federated single sign-on (SSO) between a Google Workspace instance and QuickSight account. If the SAML authentication response includes attributes that map to multiple AWS Identity and Access Management (IAM) roles, the user is
Athena now supports querying and creating Ion-formatted datasets via an Ion-specific SerDe, which in conjunction with and allows you to read and write valid Ion data. Let’s run a query that specifies the from our example row of Ion data to verify that we can read from the table:
QuickSight automatically optimizes queries and execution to help dashboards load quickly, but you can make your dashboard loads even faster and make sure you’re getting the best possible performance by
Set up two IAM roles: one that establishes a trust relationship between your IdP and AWS, and a second role that Okta uses to access Amazon Redshift. You also create an IAM role that Okta uses to access Amazon
We describe how DNS names, Kerberos realms, and AD domains are different, and the consequences of that for Amazon EMR security configuration and cluster one-way trust settings. Many of our customers