by Angela Guess
A recent press release reports, “Today, Amazon Web Services, Inc. (AWS), an Amazon.com company, launched AWS Glue, a fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data into Amazon Simple Storage Service (Amazon S3), Amazon Redshift, Amazon Relational Database Service (Amazon RDS), and databases running on Amazon Elastic Compute Cloud (Amazon EC2) for query and analysis. Customers can create and run an ETL job with a few clicks in the AWS Management Console.”
The release goes on, “Customers simply point AWS Glue at their data stored on AWS, and AWS Glue discovers the associated metadata (e.g. table definitions) and classifies it, generates ETL scripts for data transformation, and loads the transformed data into a destination data store, provisioning the infrastructure needed to complete the job. With AWS Glue, data can be available for analysis in minutes, and because AWS Glue is serverless, customers only pay for the compute resources they consume while executing data preparation and loading jobs. To learn more about AWS Glue, visit https://aws.amazon.com/glue.”
It continues, “Data integration – extracting data from various sources, normalizing it, and loading it into data stores – often represents as much as 75 percent of the time required to implement an analytics project. Customers can spend months hand coding and editing ETL scripts, which frequently become more complex and error prone as data volumes grow, and new data sources are added. And, running ETL jobs requires dedicated hardware that often sits idle between jobs. AWS Glue significantly speeds the ETL phase of analytics projects by eliminating all of the undifferentiated heavy lifting involved in creating, managing, and modifying ETL jobs.”
Read more at Business Wire.
Photo credit: AWS