black screen with code

Orchestrating analytics jobs by running Amazon EMR Notebooks programmatically

Amazon EMR is a big data service offered by AWS to run Apache Spark and other open-source applications on AWS in a cost-effective manner. Amazon EMR Notebooks is a managed environment based on Jupyter Notebook that allows data scientists, analysts, and developers to prepare and visualize data, collaborate with peers, build applications, and perform interactive analysis using EMR clusters.

Continue reading

application blur business code

Optimizing Spark applications with workload partitioning in AWS Glue

AWS Glue provides a serverless environment to prepare (extract and transform) and load large amounts of datasets from a variety of sources for analytics and data processing with Apache Spark ETL jobs. This posts discusses a new AWS Glue Spark runtime optimization that helps developers of Apache Spark applications and ETL jobs, big data architects, data engineers, and business analysts scale their data processing and batch jobs running on AWS Glue automatically.

Continue reading

Is Augmented Analytics Making the Difference It Advertises?

Although the augmented analytics vendors are continuously adding new capabilities to their products, there are some fundamental things that enterprises need to get right.
Augmented analytics tools also come with pre-built machine learning models to empower any user to do single click forecasts, identify trends and trend reversals, anomalies, outliers — tasks that in the past required involvement from professional data scientists.

Continue reading

Top 10 Data and Analytics Trends for 2021

Enterprise organizations have embraced the ideas behind advanced analytics technologies over the past several years, beginning with buzz words like big data and moving onto topics such as machine learning and artificial intelligence.
With that in mind, during its recent Gartner IT Symposium , the analyst firm unveiled its Top 10 Strategic Technology Trends in Data and Analytics, 2020, a list designed to take organizations “from crisis to opportunity,” as enterprises recover from the effects of the pandemic on business and IT initiatives.

Continue reading

job applicant passing her documents

IT Employment Looks Up; Data, Cybersecurity Skills in Demand

IT Employment Trending Up; Data, Cybersecurity Skills in Demand There’s a light at the end of the tunnel for IT pros who have been keeping a close eye on the job market.
The four employment metrics tracked by CompTIA are IT sector (tech company) employment, IT occupation employment (IT jobs across all industry sectors), the unemployment rate for IT occupations, and employer job posting for new IT hires.

Continue reading

creative internet computer display

The democratization of insights: Empowering data analysts and business users

A new set of big data tools (spurred by the release of academic papers describing Google’s internal technology) gave data engineering experts the ability to collect and store this new data, making it available to expert users who could generate insights.
Unfortunately, even with this new data made available and accessible, most business users didn’t have the skills to generate insights

Continue reading

Weka announces cloud-native, unified storage solutions for the entire data lifecycle

Weka has developed reference architectures (RAs) with leading object storage technology alliances, like Amazon Web Services (AWS), Cloudian, IBM, Seagate, Quantum, Scality , and others in Weka’s Technology Alliance Program, to deliver cost-efficient, cloud-native data storage solutions at any scale.
WekaFS provides the ease of managing petabytes of data in a single, unified namespace wherever in the pipeline the data is stored, while also delivering the best performance to accelerate artificial intelligence/machine learning (AI/ML), genomics research, high-performance computing (HPC), and high-performance data analytics (HPDA) workflows

Continue reading

1 37 38 39 40