AWS Glue is a serverless data integration service that makes it easy to find, prepare, combine data for analytics, machine learning, and application development.
Data integration refers to the preparation process of data for analysis, machine learning, application development. It involves several tasks, such as finding and extracting the data from multiple sources, enriching, cleaning, normalizing, combining the data. Of course, also the loading and organizing of the data in databases, data warehouses, data lakes. These tasks are often performed by different types of users who use different products.
AWS Glue provides both visual and code-based interfaces to facilitate data integration. Users can easily find and access data using the AWS Glue Data Catalog. Data engineers, Extract, Transform, Load (ETL) developers can use AWS Glue Studio to visually create, run, and monitor ETL workflows in just a few clicks. Data analysts and data scientists use AWS Glue DataBrew to visually enrich, cleanse, and normalize data without writing code. With AWS Glue Elastic Views, application developers use the familiar Structured Query Language (SQL) to combine and replicate data from different stores.