MIS 399 Chapter 7

  1. A newly popular unit of data in the Big Data era is the petabyte (PB), which is  1015 bytes
  2. All of the following statements about MapReduce are true EXCEPT  MapReduce runs without fault tolerance.
  3. Allowing Big Data to be processed in memory and distributed across a dedicated set of nodes can solve complex problems in near-real time with highly accurate insights. What is this process called?  in-memory analytics
  4. Big Data is being driven by the exponential growth, availability, and use of information.  True
  5. Big Data simplifies data governance issues, especially for global firms.   False
  6. Big Data uses commodity hardware, which is expensive, specialized hardware that is custom built for a client or application.  False
  7. Companies with the largest revenues from Big Data tend to be  the largest computer and IT services firms.
  8. Current total storage capacity lags behind the digital information being generated in the world.  True
  9. Data flows can be highly inconsistent, with periodic peaks, making data loads hard to manage. What is this feature of Big Data called?   Variability
  10. Despite their potential, many current NoSQL tools lack mature management and monitoring tools.  True
  11. For low latency, interactive reports, a data warehouse is preferable to Hadoop.  True
  12. Hadoop and MapReduce require each other to work.  False
  13. Hadoop was designed to handle petabytes and exabytes of data distributed over multiple nodes in parallel.  True
  14. How does Hadoop work?  It breaks up Big Data into multiple parts so each part can be processed and analyzed at the same time on multiple computers.
  15. If you have many flexible programming languages running in parallel, Hadoop is preferable to a data warehouse.  True
  16. In a Hadoop “stack,” what is a slave node?  a node where data is stored and processed
  17. In a Hadoop “stack,” what node periodically replicates and stores data from the Name Node should it fail?  secondary node
  18. In a network analysis, what connects nodes?  Edges
  19. In Application Case 7.6, Analyzing Disease Patterns from an Electronic Medical Records Data Warehouse, it was found that urban individuals have a higher number of diagnosed disease conditions.  True
  20. In most cases, Hadoop is used to replace data warehouses.   False
  21. In the Alternative Data for Market Analysis or Forecasts case study, satellite data was NOT used for   monitoring individual customer patterns
  22. In the Analyzing Disease Patterns from an Electronic Medical Records Data Warehouse case study, what was the analytic goal?  determine differences in rates of disease in urban and rural populations
  23. In the financial services industry, Big Data can be used to improve   both A & B.
  24. In the opening vignette, the Access Telecom (AT), built a system to better visualize customers who were unhappy before they canceled their service.  True
  25. In the Salesforce case study, streaming data is used to identify services that customers use most.  False
  26. In the Twitter case study, how did influential users support their tweets?  objective data
  27. It is important for Big Data and self-service business intelligence to go hand in hand to get maximum value from analytics.  True
  28. MapReduce can be easily understood by skilled programmers due to its procedural nature.  True
  29. Satellite data can be used to evaluate the activity at retail locations as a source of alternative  True
  30. Social media mentions can be used to chart and predict flu outbreaks.  True
  31. The quality and objectivity of information disseminated by influential users of Twitter is higher than that disseminated by noninfluential users.  True
  32. The term “Big Data” is relative as it depends on the size of the using organization.  True
  33. There is a clear difference between the type of information support provided by influential users versus the others on Twitter.  True
  34. Traditional data warehouses have not been able to keep up with   the variety and complexity of data.
  35. Under which of the following requirements would it be more appropriate to use Hadoop over a data warehouse?  unrestricted, ungoverned sandbox explorations
  36. Using data to understand customers/clients and business operations to sustain and foster growth and profitability is  an increasingly challenging task for today’s enterprises.
  37. What is Big Data’s relationship to the cloud?  Amazon and Google have working Hadoop cloud offerings.
  38. What is the Hadoop Distributed File System (HDFS) designed to handle?  unstructured and semistructured non-relational data
  39. Which Big Data approach promotes efficiency, lower cost, and better performance by processing jobs in a shared, centrally managed pool of IT resources?   grid computing
  40. Which of the following sources is likely to produce Big Data the fastest?  RFID tags

Other Links:

Statistics Quiz

Networking Quiz

See other websites for quiz:

Check on QUIZLET

Check on CHEGG

Leave a Reply

Your email address will not be published. Required fields are marked *