1
/
of
1
BOOKZONE
Parallel R: Data Analysis in the Distributed World
Paperback
by Q. Ethan Mccallum
SKU:9789350236802
Regular price
₹616.25INR
Regular price
₹725.00INR
Sale price
₹616.25INR
Shipping calculated at checkout.
Bulk Discount Get Exta 5% upto 10%
Share

1
/
of
1
It’stough to argue with R as a high-quality, cross-platform, open sourcestatistical software product—unless you’re in the business of crunching BigData. This concise book introduces you to several strategies for using R toanalyze large datasets. You’ll learn the basics of Snow, Multicore, Parallel,and some Hadoop-related tools, including how to find them, how to use them,when they work well, and when they don’t.
Withthese packages, you can overcome R’s single-threaded nature by spreading workacross multiple CPUs, or offloading work to multiple machines to address R’smemory barrier.
-
Snow: works well in a traditional cluster environment
-
Multicore: popular for multiprocessor and multicore computers
-
Parallel: part of the upcoming R 2.14.0 release
-
R+Hadoop: provides low-level access to a popular form of cluster computing
-
RHIPE: uses Hadoop’s power with R’s language and interactive shell
-
Segue: lets you use Elastic MapReduce as a backend for lapply-style operations
