Amazon Web Services (AWS) provides a dizzying array of cloud services, from the well known Elastic Compute Cloud (EC2) and Simple Storage Service (S3) to platform as a service (PaaS) offerings covering almost every aspect of modern computing.

Specifically, AWS provides a mature big data architecture with services covering the entire data processing pipeline — from ingestion through treatment and pre-processing, ETL, querying and analysis, to visualization and dashboarding. AWS lets you manage big data seamlessly and effortlessly, without having to set up complex infrastructure or deploy software solutions like Spark or Hadoop.

In this article I’ll cover five Amazon services, each covering an essential element of the modern data science workflow.

1. Amazon EMR

2. AWS Glue

3. Amazon SageMaker

4. Amazon Kinesis Video Streams

5. Amazon QuickSight

#big-data #aws #data-science 

top 5 AWS Services Every Data Scientist Needs To Know Right Away
1.35 GEEK