Athena

What is Athena

Interactive query service which enables you to analyse and query data located in S3 using standard SQL

- Serverless, nothing to provision, pay per query / per TB scanned. 
- No need to set up complex Extract/Transform/Load (ETL) processes.
                                    
What can Athena be used for?
- Can be used to query log files stored in S3, eg ELB logs, S3 access logs ect.
- Generate business reports on data stored in S3 and analyse AWS cost and Usage reports.
- Run queries on click-stream data that is stored in S3.
- Amazon Athena uses Presto with full standard SQL support and works with a variety of standard data formats, including  CSV, JSON, ORC, Apache Parquet and Avro.
- While Amazon Athena is ideal for quick, ad-hoc querying and integrates with Amazon QuickSight for easy visualization, it can also handle complex analysis, including large joins, window functions, and arrays.
                                    

Macie

What is PII (Personally Identifiable Information)
- Personal data used to establish an individual's identity.
- This data could be exploited by criminals, and used in identity theft and financial fraud.
- Home address, email address, SSN, Passport number, Drivers license number, DOB, phone number, bank account, credit card number.
                                    
What is Macie
Security service which uses Machine Learning and NLP (Natural Language Processing) to discover, classify and protect sensitive data stored in S3.
- Uses AI to recognise if your S3 objects contain data such as PII.
- Includes Dashboards, reporting and alerts and works directly with data stored in S3.
- Can also analyse CloudTrail logs for suspicious API activity.
- Great for PCI-DSS (credit card processing) and preventing ID theft.