- Bins can be created for which type of field? Continuous Measure
- Caching is optimizing the technique True
- Can we add or setup new string computation after SparkContext starts No
- Can we edit the data of RDD, for example, the case conversion? No
- Can you combine the libraries of Apache Spark into the same Application, for example, MLlib, GraphX, SQL and DataFrames etc. Yes
- Does Spark R make use of MLlib in any aspect? Yes
- Dstream internally is Continuous Stream of RDD
- How many Spark Context can be active per JVM? Only one
- How many tasks does Spark run on each partition? One
- How much faster can Apache Spark potentially run batch-processing programs when processed in memory than MapReduce can? 100 times faster
- If you have bar chart with a single bar, but you want to subdivide that bar into multiple categories, what should you add to your view? A dimension
- If you want to replace one Field with another without breaking your worksheet Use the Replace References feature
- In a Calculated Field, the following will be interpreted as a comment Anything after // will be treated as a comment
- In aggregate function can we get the data type different from as that input data type? Yes
- In Dataframe in Spark Once the domain object is converted into a data frame, the regeneration of domain object is not possible. True
- In Spark SQL optimization which of the following is not present in the logical plan Abstract syntax tree
- In the Physical planning phase of Query optimization we can use both Coast-based and Rule-based optimization True
- In the Southern Region Superstore data, the Product Category, Product Subcategory, and Product Name might be combined using which of the following: Hierarchy
- In which of the following Action the result is not returned to the driver. foreach()
- Is Spark included in every major distribution of Hadoop? Yes
- Spark is developed in which language Scala
- Spark SQL plays the main role in the optimization of queries. True
- SparkContext guides how to access the Spark cluster. True
- SparkSQL translates commands into codes. These codes are processed by Executor Nodes
- The basic abstraction of Spark Streaming is Dstream
- The default storage level of cache() is? MEMORY_ONLY
- The most important items on a dashboard should be positioned where? Upper Left
- The primary Machine Learning API for Spark is now the _____ based API DataFrame
- This optimizer is based on functional programming construct in Scala
- This type of join will include all values from the left table: Left
- We can extend the functionality of Tableau through Third-Party companies True
- What are the parameters defined to specify window operation Window length, sliding interval
- What is a transformation in Spark RDD? Takes RDD as input and produces one or more RDD as output.
- What is the maximum number of Columns in a Table 16 columns
- When creating a union between tables, what will happen if the field names do not match? Fields in the union that do not have matching field names will contain null values.
- When using data source Fields in Calculated Fields, the Square brackets are optional True
- Which chart type will best visualize the relationship between two continuous measures? Scatter Plot
- Which feature will allow you to show more relevant or descriptive dimension values in your view than what the original data provides? Alias
- Which is not a component on the top of Spark Core? Spark RDD
- Which of the descriptions below describes a Tableau Story A contains a sequence of worksheets or dashboards that work together to convey information
- Which of the following algorithm is not present in MLlib? Tanimoto distance
- Which of the following fields will have AGG in front of it when you add it to the view? A calculated field with the formula SUM([Profit])
- Which of the following functions can be used on string to create a measure? COUNTD
- Which of the following is a module for Structured data processing? Spark SQL
- Which of the following is a table calculation? TOTAL
- Which of the following is action? CountByValue()
- Which of the following is false for Apache Spark? Spark is an open source framework which is written in Java
- Which of the following is not a component of Spark Ecosystem? Sqoop
- Which of the following is not a default property which can be set for a measure? Sort
- Which of the following is not a function of Spark Context in Apache Spark? Entry point to Spark SQL
- Which of the following is not a Spark SQL query execution phases? Execution
- Which of the following is not an action? map
- Which of the following is not output operation on DStream ReduceByKeyAndWindow
- Which of the following is not the feature of Spark? It is cost efficient
- Which of the following is not true for DataFrame? DataFrame in Apache Spark is behind RDD
- Which of the following is not true for Hadoop and Spark? Both have their own file system
- Which of the following is the best reason to create a saved data source as a .TDS file You want to save the default field properties such as number formats and sort order
- Which of the following is the entry point of Spark Application SparkContext
- Which of the following is the entry point of Spark SQL? SparkSession
- Which of the following is the fundamental data structure of Spark RDD
- Which of the following is the reason for Spark being Speedy than MapReduce? DAG execution engine and in-memory computation
- Which of the following is true about DataFrame? DataFrames provide a more user-friendly API than RDDs.
- Which of the following is true for Spark core? It is the kernel of Spark
- Which of the following is true for Spark SQL? It enables users to run SQL / HQL queries on the top of Spark.
- Which of the following provide the Spark Core’s fast scheduling capability to perform streaming analytics. Spark Streaming
- Which of the following should be used if, for security purposes, you’d like users to enter their database credentials in order to access the workbook: Live Connection
- Which of the following uses a primary and secondary data source? Blending
- You can add additional Color palettes to Tableau True
- You have a dashboard and you want to add a feature so that selecting an element on one of the worksheets on the dashboard causes a change on another worksheet in the same dashboard. What features will do this? (select all that apply) Filter Action.Highlight Action.
- You have a scatter plot showing sales on the x-axis and profit on the y-axis, with each dot representing a different product subcategory. You want to show how the points on the scatter plot vary based on Net Profit, a third continuous variable. Which would be most appropriate? (Select all that apply) Size.Color.
- You have SUM([Minutes of Delay]) in your view. If aggregate measures is unchecked, what will the view show instead of SUM([Minutes of Delay])? Minutes of Delay
- You start with a blank worksheet and add a continuous field to the row shelf. How will this new field change your view? It will add a vertical axis
Other Links:
See other websites for quiz:
Check on QUIZLET