What is R4ML?

R4ML is a scalable, hybrid approach to ML/Stats using R, Apache SystemML, and Apache Spark

R4ML Key Features

R4ML is a git downloadable open source R package from IBM
Created on top of SparkR and Apache SystemML (so it supports features from both)
Acts as a R bridge between SparkR and Apache SystemML
Provides a collection of canned algorithms
Provides the ability to create custom ML algorithms
Provides both SparkR and Apache SystemML functionality
APIs are friendlier to the R user

R4ML Architecture

How to install

Quick install (run from R console):

# Download Apache Spark 2.1.0 (Note: Java must be installed)
download.file("http://archive.apache.org/dist/spark/spark-2.1.0/spark-2.1.0-bin-hadoop2.7.tgz", "~/spark-2.1.0-bin-hadoop2.7.tgz")
system("tar -xvf ~/spark-2.1.0-bin-hadoop2.7.tgz")
Sys.setenv("SPARK_HOME" = file.path(getwd(), "spark-2.1.0-bin-hadoop2.7"))

# Add the library path for SparkR
.libPaths(c(.libPaths(), "~/spark-2.1.0-bin-hadoop2.7/R/lib/"))

# Install R4ML dependencies
install.packages(c("uuid", "R6"), repos = "http://cloud.r-project.org")

# Download and install R4ML
download.file("http://codait-r4ml.s3-api.us-geo.objectstorage.softlayer.net/R4ML_0.8.0.tar.gz", "~/R4ML_0.8.0.tar.gz")
install.packages("~/R4ML_0.8.0.tar.gz", repos = NULL, type = "source")

# Load dependencies and use R4ML
library("SparkR", lib.loc = "~/spark-2.1.0-bin-hadoop2.7/R/lib/")
library("R4ML")
r4ml.session(sparkHome = file.path(getwd(), "spark-2.1.0-bin-hadoop2.7"))

More detailed instructions can be found here.

How to Use R4ML

Once you have installed R4ML it is time to use it for scalable machine learning and data analysis. Look at the section on R4ML Examples.

R4ML Documentation

After you follow the instruction at 'How to install', you can point your browser to

$R4ML_INSTALLED_LOCATION/R4ML/html/00Index.html

For example, if you have installed in the /home/data-scientist/codait then open a web browser and type in the following in the url

file:///home/data-scientist/codait/R4ML/html/00Index.html

Name		Name	Last commit message	Last commit date
Latest commit History 157 Commits
.github		.github
R4ML		R4ML
bin		bin
docs		docs
.gitattributes		.gitattributes
.gitignore		.gitignore
ContributionGuide.md		ContributionGuide.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is R4ML?

R4ML Key Features

R4ML Architecture

How to install

How to Use R4ML

R4ML Documentation

About

Releases

Packages

Contributors 13

Languages

License

CODAIT/r4ml

Folders and files

Latest commit

History

Repository files navigation

What is R4ML?

R4ML Key Features

R4ML Architecture

How to install

How to Use R4ML

R4ML Documentation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 13

Languages

Packages