Systems and Platforms for Big Data Statistical Analysis
Preface
1.
Chapter 1 Introduction
2.
Chapter 2 Computing Infrastructures
3.
Chapter 3 Clusters
3.1.
Multicore Computing
3.2.
Virtual Machine Clusters
3.2.1.
Create A VM Node
3.2.2.
Create Additional VM Nodes
3.3.
Purdue Cluster
3.4.
Amazon Web Service
4.
Chapter 4 R for Statistical Analysis
5.
Chapter 5 Parallel Computing in R
5.1.
Explicit Parallelization
5.2.
Implicit Parallelization
5.2.1.
Using OpenMP
5.3.
Combined Approach
6.
Chapter 6 Bridge R and Big Data Platforms
6.1.
Tessera
6.2.
SparkR
7.
Chapter 7 Native Distributed Computing in R
8.
Appendix
8.1.
Install OpenJDK 8 on Ubuntu Server 14.04
8.2.
Oracle VirtualBox Networking
Powered by
GitBook
Systems and Platforms for Big Data Statistical Analysis
Combined Approach