Run the benchmarks on the data that is stored. We set up the cluster in AWS in the following way: An instance to run Spark with a Yarn Hadoop cluster in pseudo-distributed mode. In this configuration, ...