Categories
Paxcel Labs Big Data Beginners

Review of Supervised Learning Algorithms on Medical Data

In order to review the accuracy of supervised machine learning algorithms, we took a dataset named “prostate” available in “elemStatLearn” packager in R. “prostate” is a dataset to examine Read More[...]
Categories
BI Big Data

Using Multiple R servers for distributed processing

Purpose of this blog is to demonstrate how to use R-servers in a distributed manner i.e how to access multiple R servers simultaneously for reasons of distributed processing, performance enhancement. We Read More[...]
Categories
BI Big Data

Loading Large Data files in R

To start working with large data sets in R, the first question is how to load the data for further analysis. Our data tests consisted of large CSV files having data as matrices, which was needed for further Read More[...]
Categories
Paxcel Labs BI Big Data

Finding the optimum partition size while parallelizing large matrix operation in R

Problem Statement: Finding the optimum partition size while parallelizing large matrix operation in R Description: How R’s parallel system works? Before moving to analysis, we must know how Read More[...]
Categories
Java Big Data

Returning & Using Multiple Values from a HIVE UDF

One of the typical problems faced while implementing User Defined Functions (UDF) in HIVE is - How to return multiple values from it, and how to use the multiple values (columns) in the HIVE select statement. In Read More[...]
Categories
Paxcel Labs Microsoft Tech Big Data

HDInsight: Installation on Windows platform

Guide for starting with HDInsight on windows Installing HDInsight on windows: HDInsight installer is powered by Microsoft Web Platform Installer. To download it you can use the following link: http://www.microsoft.com/web/gallery/install.aspx?appid=HDINSIGHT-PREVIEW After Read More[...]
Categories
Paxcel Labs Microsoft Tech Big Data

Microsoft’s Big Data Initiative with Apache Hadoop, HIVE and PIG

Big data:                                                               Nowadays the hottest topic of discussion in the technology world Read More[...]
Categories
Paxcel Labs Java Microsoft Tech BI Big Data

Expectations from Hadoop On Azure

Microsoft has launched hadoop on its cloud platform Azure. So I , along with Pushpinder Singh and Sukhjot Singh here in our Paxcel Labs ,analyzed what hadoop on Azure offers to the market. Right now Microsoft has not launched HOA(Hadoop on Azure) for all, but they give access to use it in their test environments through invitation only. After getting access we started exploring it. Read More[...]
Categories
Paxcel Labs Java Big Data

Hadoop installation on Linux – A Comprehensive Guide

Troubled by the myriad of errors you get while installing Hadoop on Linux? This is a comprehensive guide for Hadoop installation on Linux that would make your installation a smooth process. Read More[...]