What is the most likely reason for an HBase table to contain millions of columns?
A. Data is imported from a relational database table
B. Data is stored in the column qualifier
C. There are thousands of columns families
D. The column names are randomly generated
In a social network, what does it mean for a node to have a high degree but low betweenness?
A. The node is adjacent to a few nodes, each of each has high Page Ranks.
B. The node has the only edge connecting its community to the rest of the graph.
C. The node can be easily bypassed by communications taking other shorter paths.
D. The node acts as the hub of the graph.
What is NOT a category of a NoSQL data store?
A. Columnar
B. Document
C. Key/Value
D. Flat File
What is the maximum number of edges in an undirected graph of 10 nodes?
A. 45
B. 90
C. 100
D. 9
Consider dataset that resides in HDFS. Which tool natively provides the capability to run a Random Forests model against this data?
A. Mahout
B. Pig
C. Hive
D. HBase
How is the relative value of a node visualized in a sunburst?
A. Color
B. Area
C. Gradient
D. Position
What is a characteristic of the trigram language model?
A. Based on the second-order Markov process
B. Equivalent to trigram hidden Markov models
C. Uses smoothing to reduce the high dimensionality in text
D. Can be used for part-of -speech tagging
In multinomial logistic regression, what is used to calculate the probability of outcome occurring?
A. Logistic function applied to a linear combination of the input and outcome variables
B. Linear regression applied to a combination of input variables
C. Linear regression applied to a combination of input and outcome variables
D. Logistic function applied to a linear combination of the input variables
What best describes the meaning behind the phrase "Six Degrees of Separation'"?
A. Ability to use about six hops to reach any other node in an extremely large social network
B. Erdos number of all scholars having written papers with Paul Erdos
C. Maximum number of edges between nodes in a graph with a diameter of six
D. Typical distance between nodes that are connected by triadic closure
After a client submits a job request to the YARN ResourceManager, what happens next?
A. The scheduler allocates a container to run an ApplicationMaster
B. The ResourceManager allocates containers to run map and reduce tasks
C. The Resource Manager requests load data from the NodeManagers
D. The ApplicationManager starts an ApplicationMaster