Tag Archives: hadoop

Eclipse Map and Reduce Plugin & Hadoop Tutorial

Questions: I’m brand new to Hadoop and I’m following this Yahoo Tutorial (http://developer.yahoo.com/hadoop/tutorial/). I’m currently trying to configure eclipse and the map and reduce plugin to connect to the virtual machine. One of the settings I need to configure is the hadoop.job.ugi. It does not appear under the Advanced Settings tab of the plugin. Without… Read More »

Job failed Exception hadoop

Questions: I am using multi text output formate to create multiple files of a single file i.e each line on new file. This is my code: public class MOFExample extends Configured implements Tool { private static double count = 0; static class KeyBasedMultipleTextOutputFormat extends MultipleTextOutputFormat<Text, Text> { @Override protected String generateFileNameForKeyValue(Text key, Text value, String… Read More »

Hadoop Documentation for Eclipse

Questions: I recently installed Hadoop and am able to run simple programs. However I would like to view documentation for Hadoop classes within Javadoc browser in Eclipse. Please let me know how to enable that (I am a little novice with Eclipse IDE). Thanks. Answers: A couple of suggestions: If you’re using maven for your… Read More »

How to write Dataset to a excel file using hadoop office library in apache spark java

Questions: Currently I am using com.crealytics.spark.excel to read excel file,but using this library I can’t write the dataset to an excel file. this link says that using hadoop office library (org.zuinnote.spark.office.excel) we can read and write to the excel file Please help me to write dataset object to an excel file in spark java. Answers:… Read More »

How to write Dataset to a excel file using hadoop office library in apache spark java

Questions: Currently I am using com.crealytics.spark.excel to read excel file,but using this library I can’t write the dataset to an excel file. this link says that using hadoop office library (org.zuinnote.spark.office.excel) we can read and write to the excel file Please help me to write dataset object to an excel file in spark java. Answers:… Read More »

InstantiationException in hadoop map reduce program

Questions: I am new to Hadoop framework. I was trying to write a program which reads XML file from hdfs, parses it using JDOM and sends it to a database. The following is the Java file package JDOMprs; import java.io.IOException; import java.util.ArrayList; import java.util.List; import org.apache.hadoop.conf.Configuration; import org.apache.hadoop.conf.Configured; import org.apache.hadoop.fs.Path; import org.apache.hadoop.io.DoubleWritable; import org.apache.hadoop.io.LongWritable; import… Read More »

How to write Dataset to a excel file using hadoop office library in apache spark java

Questions: Currently I am using com.crealytics.spark.excel to read excel file,but using this library I can’t write the dataset to an excel file. this link says that using hadoop office library (org.zuinnote.spark.office.excel) we can read and write to the excel file Please help me to write dataset object to an excel file in spark java. Answers:… Read More »

Apache Nutch 1.9 on Hadoop 1.2.1 no Crawl class in jar file

Questions: I’m running a Cluster of five Cubieboards, RaspberryPi-like ARM boards with (because of 32bit) Hadoop 1.2.1 installed on them. There is one Name Node and four Slave Nodes. For my final paper I wanted to install Apache Nutch 1.9 and Solr for big data analysis. I did the setup explained like this: http://wiki.apache.org/nutch/NutchHadoopTutorial#Deploy_Nutch_to_Multiple_Machines When… Read More »