1. Most organizations that use Hadoop also use a set of other related products that interact and complement each other to produce an entire ______ of applications and tools.
Ecosystem
2. You want to use inequalities such as greater than, and logical operators in a query. What clause can you use to allow this?
WHERE
3.___ applications attempt to provide faster query access than is possible through MapReduce by interacting with HDFS directly, instead of going through the MapReduce processing layer.
Direct Query Applications
4. Hadoop uses a replication factor of ______.
three
5. You are a Hadoop developer, you want to query the value component of the pair when using a key-value database, you should use get or ______.
fetch
6. In the context of Big Data, ______ refers to the changes in meaning.
variability
7. ______ was the first SQL on Hadoop application.
Impala
8. The nodes that store the actual file data within the HDFS are called __.
Data nodes
9. As a database administrator, which database would you use in a scenario where you want to model and store data about relationship-rich environments?
graph
10. What is the name of the property given in the command below:
Command: CREATE (:Member {mid: 1, fname: “Phillip”, lname: “Stallings”})
Mid
11. In the HDFS, a signal sent every 3 seconds from the data node to the name node to notify the name node that the data node is still available is called ___.
Heartbeat
12. Graph databases are heavily reliant on ___ among their data.
Discrimination
13. Neo4j is a ______ database.
graph
14. You are a Hadoop administrator; you are teaching the new hires a refresher course on the Hadoop framework. You explain that the two most important components of Hadoop are the Hadoop Distributed File System (HDFS) and ___.
MapReduce
15. Within Hadoop, _________can transfer data in both directions, into and out of HDFS.
Sqoop
16. What is the label name in the command below:
Command: CREATE (:Member {mid: 1, fname: “Phillip”, lname: “Stallings”})
Member
17. You want to store data about the node. This data being stored about the node are like attributes. Which of the options below do you want to store?
Properties
18. ___ is a low-level distributed file processing system.
HDFS
19. A report sent every 6 hours by the data node to the name node to inform the name node of the blocks on that data node is called ___.
Block report
20. As part of a big data development team, you want your team to apply the central control program used to accept, distribute, monitor, and report on MapReduce processing jobs in a Hadoop environment. What central control program would you suggest?
Job tracker
