Usage/Deployment Instruction
Step 1: Open Putty for SSH
Step 2: Open Putty and Type <instance public IP> at “Host Name”

Step 3: Open Connection->SSH->Auth tab from Left Side Area
Step 4: Click on browse button and select ppk file for Instance and then click on Open

Step 5: Type "ubuntu" as user name Password auto taken from PPK file

Step 5.1: if you get any update option from Ubuntu then follow these steps:

Then follow these commands
$ apt-get update

$ apt-get upgrade

Step 6: Use following Linux command to start Hadoop
Step 6.1: $ sudo vi /etc/hosts
Take the Private Ip address from your machine as per the below screenshot and then replace the second line of your command screen with that Private ip address


Step 6.2: $ ssh-keygen -t rsa -P ""
This command is used to generate the ssh key.

Step 6.3: $ cat $HOME/.ssh/id_rsa.pub >> $HOME/.ssh/authorized_keys
This command is used to move the generated ssh key to the desired location

Step 6.4: ssh localhost

Step 6.5: hdfs namenode –format
You have to write “yes” when it prompts you – Are you sure you want to continue?

Step 6.6: start-all.sh

Step 6.7: After the above command executes successfully, you should check the below urls in the browser -
http://<instance-public-ip>:8088

http://<instance-public-ip>:50070

http://<instance-public-ip>:50090

Step 7: Start Hbase
$ cd /usr/local/hbase/bin
$ start-hbase.sh

Step 8: Start R console
$ R


Step 9: Start RStudio Server
$ cd ~
$ sudo gdebi rstudio-server-0.98.1028-amd64.deb

$ sudo rstudio-server start

Step 10: Update user rstudio password
$ sudo passwd rstudio

Step 10.1: Configure r-hadoop
Open rstudio in browser
http://<instance-public-ip>:8787/
Example:
http://54.237.233.225:8787/
Enter rstudio user and its newly generated password

After login screen look like this:

For install r-hadoop packages:
Select à Tools à Install packages à Install from : Package Archive File(.tr.gz)


Select Browse button
Open new file explorer window and select all available packages for r-hadoop service. Available packages are:
rhdfs_1.0.8.tar.gz
rhbase_1.2.1.tar.gz
plyrmr_0.6.0.tar.gz
ravro_1.0.4.tar.gz
rmr2_3.3.1.tar.gz

After that you will enjoy with Predictive Analytics Framework R Hadoop with your own commands, Environment Ready for use.