EMR

  • Elastic Map Reduce
  • Use for
  • Big data processing
    • Hadoop Clusters
    • Apache Spark
  • Collect and process EC2 log files
  • Collect and process ALB log files
  • Supports
  • Apache Spark
  • HBase
  • Presto
  • Flink
  • To save cost, the EC2 instances are used to run in SPot Instances
  • When it comes to process lot of log files, EMR could be a go to move