1. Define Big Data?
Big Data is defined as the Unstructured or Semi Structured Data which cannot be stored in to Traditional RDBMS
2. What are the 4 critical features in Big Data?
The 4 Critical features or also called as 4 ‘V’s in Big Data are
Volume, Velocity, Variety and Veracity
3. Mention the List of Top companies that are using Hadoop?
Facebook, Yahoo, Amazon, eBay, Netflix, Adobe and Twitter are some of the Top companies that are using Hadoop
4. What are the Core components of Hadoop Frame works?
Hadoop Frame works work on two major and core components namely Hadoop Distributed File System(HDFS) and Hadoop MapReduce.
5. Mention the important components of Hadoop Application?
Hadoop Common,HDFS,Hadoop MapReduce,YARN are the important components of Hadoop Application.
6. Mention Different Data Access Components in Hadoop?
Pig and Hive are the Data Access Components in Hadoop.
7. Mention Different Data Storage and Data Integration Components in Hadoop?
HBase is a Data Storage Component and different Data Integration Components are Apache, Flume, Sqoop and Chukwa.
8. Mention Data Management and Monitoring Components in Hadoop?
Ambari,Oozie and Zookeper are Data Management and Monitoring components in Hadoop.
9. Mention Data Serialization and Intelligence Components in Hadoop?
Thrift and Avro are Data Serialization Components in Hadoop.The Data Intelligence components are Apache, Mahout and Drill
10. What do you mean by Hadoop Streaming?
For Writing Map and Reduce Jobs in any programming language Hadoop distribution has a generic application programming interface often referred as Hadoop Streaming .With the help of this Users can create and run jobs with all kinds of shell scripts or executable as Mapper or Reducers.
11. What is the best configuration for executing Hadoop Jobs?
Hadoop jobs works under dual core processors with 4GB RAM that uses ECC Memory which is recommendable to remove checksum errors and Hardware configuration depends on work flow requirements.
12. What are the most common input formats defined in Hadoop?
The Most common input formats defined in Hadoop are Text Input Format, Key value input format and Sequence file input format.
0 comments:
Post a Comment