Practice Test -Hadoop

Total
Attempted
Skipped
Correct
Wrong
Score
1. What is distributed cache?



2. You want to count the number of occurrences for each unique word in the supplied input data.You’ve decided to implement this by having your mapper tokenize each word and emit a literal value 1, and then have your reducer increment a counter for each literal 1 it receives. After successful implementing this, it occurs to you that you could optimize this by specifying a combiner. Will you be able to reuse your existing Reduces as your combiner in this case and why or why not?



3. Which of the following MapReduce execution frameworks focus on execution in sharedmemory environments?



4.  Can you provide multiple input paths to a map-reduce jobs?



5.  In a MapReduce job, the reducer receives all values associated with same key. Which statement best describes the ordering of these values?



6. What is map - side join?



7. What are map files and why are they important?



8. What is the input to the Reduce function?



9. Why would a developer create a map-reduce without the reduce step?



10. How can a distributed filesystem such as HDFS provide opportunities for optimization of a MapReduce operation?



Total Questions
Attempted
Not Attempted
Correct
Wrong
Hadoop

BNS and IPC comparison considering top 10 changes and MCQ for LAWCET preparation baed on this comparison....

Read More..

 Full Length Mock Tests
 Answers with Explanation**
 Timer Based Exams
 Instant Result and assesment
 Detailed analasys of Result