IT data analytics Archives - Page 16 of 27 - Global Intelligence and Insight Platform: IT Innovation, ETF Investment, plus Health Wellbeing

Crawl Delta Lake tables using AWS Glue crawlers

September 13, 2022 GeneAka

You can grant Lake Formation permissions on the Delta tables created by the crawler to AWS principals that then query through Athena and Redshift Spectrum to access data in Delta tables. The AWS Glue crawler

Implement a highly available key distribution center for Amazon EMR

September 13, 2022 GeneAka

When creating an Amazon EMR security configuration, you’re asked to choose between a cluster-dedicated KDC or an external KDC, so it’s important to understand the benefits and limits of each solution. Considering the case in which the KDC is shared with other EMR clusters

Create single output files for recipe jobs using AWS Glue DataBrew

September 13, 2022 GeneAka

You can now choose single or multiple output files instead of autogenerated files for your DataBrew recipe jobs. In this post, we walk you through how to connect and transform data from an Amazon Simple

New additions to line charts in Amazon QuickSight

September 13, 2022 GeneAka

In such cases, instead of displaying a broken line chart that skips Sunday, you may want to show a continuous trend by directly connecting Saturday to Monday, hiding the fact that Sunday isn’t operational.

Integrate AWS IAM Identity Center (successor to AWS Single Sign-On) with AWS Lake Formation fine-grained access controls

September 13, 2022 GeneAka

Integrating Lake Formation with IAM Identity Center can help you manage data access at the organization level, consolidating AWS account and data lake authentication and authorization. When the permission sets are assigned to your data lake account, IAM Identity

8 Reasons to Build Your Cloud Data Lake on Snowflake

September 6, 2022 GeneAka

When you store data in Snowflake, your experience is drastically simplified because many storage management functionalities are handled automatically. Snowflake simplifies managing privileges to your

Interactively develop your AWS Glue streaming ETL jobs using AWS Glue Studio notebooks

September 6, 2022 GeneAka

This integration enables you to define data filters in Lake Formation that specify row-level and cell-level access control for users on your data and then query it using Redshift Spectrum. To solve this use case, we

Store Amazon EMR in-transit data encryption certificates using AWS Secrets Manager

September 6, 2022 GeneAka

In this post, I guide you through the configuration process and provide Java code samples to secure data in transit on Amazon EMR by storing TLS custom certificates using AWS Secrets Manager . The security

Interactively develop your AWS Glue streaming ETL jobs using AWS Glue Studio notebooks

September 6, 2022 GeneAka

An AWS Glue streaming ETL job consumes the data in near-real time and runs an aggregation that computes how many times a webpage has been unavailable (status code 500 and above) due to an internal error. In

Easy analytics and cost-optimization with Amazon Redshift Serverless

September 6, 2022 GeneAka

Amazon Redshift Serverless makes it easy to run and scale analytics in seconds without the need to setup and manage data warehouse clusters. To address these issues, they decide to let the data science team create

Use Amazon Redshift Spectrum with row-level and cell-level security policies defined in AWS Lake Formation

September 6, 2022 GeneAka

Convert Oracle XML BLOB data to JSON using Amazon EMR and load to Amazon Redshift

September 6, 2022 GeneAka

In this example, we use AWS DMS to extract data from an Oracle database with XML BLOB fields and stage the same data in Amazon Simple Storage Service (Amazon S3) in Apache Parquet format. After the

How Fresenius Medical Care aims to save dialysis patient lives using real-time predictive analytics on AWS

August 30, 2022 GeneAka

We needed to develop a near-real-time analytics solution that would collect dynamic dialysis machine data every 10 seconds during hemodialysis treatment in near-real time and personalize it to predict

Visualize Amazon S3 data using Amazon Athena and Amazon Managed Grafana

August 30, 2022 GeneAka

In this post, we show how you can create and configure a dashboard in Amazon Managed Grafana that queries data stored on Amazon S3 using Athena. The solution is comprised of a Grafana dashboard, created in

« 1 … 14 15 16 17 18 … 27 »