Professional Documents
Culture Documents
AI Fundamentals Midterm Exam - Attempt Review
AI Fundamentals Midterm Exam - Attempt Review
Question 1
Correct
Select one:
a. To measure the dissimilarity between two probability distributions
Question 2
Correct
Select one:
a. The step where the expectation of the latent variables is calculated
Correct
What is the process of evaluating the performance of a trained perceptron on unseen data called?
Select one:
a. Pruning
b. Validation
c. Testing
d. Training
Question 4
Correct
The KL distance is always positive and is equal to zero only when the two probability distributions are:
Select one:
a. Identically distributed
b. Mutually exclusive
c. Independently distributed
d. Uniformly distributed
Question 5
Incorrect
What is the process of using data mining techniques to identify trends and make predictions called?
Select one:
a. Data mining
b. Data modeling
c. Data visualization
d. Data analysis
Question 6
Correct
Select one:
a. An algorithm that processes the training data one example at a time
Question 7
Incorrect
Select one:
a. All of the above
Question 8
Correct
Select one:
a. It requires the user to specify the number of clusters in advance
b. It is sensitive to the initial placement of centroids
Correct
Can the least squares method be used for nonlinear data sets?
Select one:
a. It depends on the data set
b. Yes
c. It depends on the method used to transform the data set
d. No
Question 10
Incorrect
The ______________ linkage criterion is a popular choice for hierarchical clustering, which merges the two clusters that have the
maximum distance between them.
Select one:
a. Single
b. Complete
c. Average
d. Centroid
Question 11
Correct
Select one:
a. Shannon entropy
b. Mutual information
c. Joint entropy
d. Cross-entropy
Question 12
Correct
Select one:
a. It is unable to handle nonlinear relationships
b. It is prone to overfitting
c. It is unable to handle large datasets
d. It is slow to converge
Question 13
Incorrect
Select one:
a. All of the above
Question 14
Correct
What is the process of adjusting the weights of a perceptron based on the error calculated during validation called?
Select one:
a. Testing
b. Pruning
c. Training
d. Validation
Question 15
Correct
How does the k-means algorithm determine which data points belong to which cluster?
Select one:
a. By computing the distance between data points and the centroid of each cluster
Question 16
Correct
The KL distance between two discrete probability distributions P and Q is defined as:
Select one:
a. The sum of the products of the probabilities of each event in P and Q
b. The sum of the differences between the probabilities of each event in P and Q
c. The sum of the ratio of the probabilities of each event in P and Q
d. The sum of the logarithm of the ratio of the probabilities of each event in P and Q
Question 17
Correct
Select one:
a. The step where the expectation of the latent variables is calculated
b. The step where the likelihood of the model is maximized
Correct
Select one:
a. None of the above
Question 19
Correct
Select one:
a. By selecting the set of clusters that minimize the within-cluster variance
b. By selecting the set of clusters that maximize the sum of squared errors
c. By selecting the set of clusters that minimize the sum of squared errors
Question 20
Correct
How is the line of best fit calculated using the least squares method?
Select one:
a. By minimizing the sum of the squares of the errors between the data points and the line of best fit
b. By minimizing the mean of the data set
c. By minimizing the sum of the absolute values of the errors between the data points and the line of best fit
Incorrect
Select one:
a. They can learn from a limited amount of resources
Question 22
Incorrect
Select one:
a. Classification
d. Clustering
Question 23
Correct
Select one:
a. Data analysis
b. Data modeling
c. Data visualization
d. Data mining
Question 24
Correct
Select one:
a. It is a data point that is randomly chosen to be the initial center of a cluster
Question 25
Correct
Select one:
a. It is used to determine the input to the neural network
b. It is used to adjust the weights of the neural network based on the input and output
c. It is used to calculate the output of the neural network
Question 26
Incorrect
The KL distance can be used to measure the information lost when approximating one distribution with another. In this context, the
distribution being approximated is known as the:
Select one:
a. Target distribution
b. Approximation distribution
c. Reference distribution
d. Base distribution
Question 27
Correct
How is the slope of the line of best fit calculated using the least squares method?
Select one:
a. By dividing the sum of the y values by the sum of the squares of the x values
b. By dividing the sum of the product of the x values and the y values by the sum of the x values
c. By dividing the sum of the product of the x values and the y values by the sum of the squares of the x values
Question 28
Correct
Select one:
a. The likelihood of the model
Question 29
Correct
What is the advantage of using the Gaussian Naive Bayes classifier over other types of Naive Bayes classifiers?
Select one:
a. It is able to handle continuous features
b. It is more accurate
Incorrect
The ______________ linkage criterion is a popular choice for hierarchical clustering, which merges the two clusters that have the
minimum distance between them.
Select one:
a. Centroid
b. Complete
c. Average
d. Single
Question 31
Correct
Select one:
a. Grouping customers into different segments based on their spending habits
Question 32
Incorrect
What is the process of calculating the error between the desired output and the actual output of a perceptron called?
Select one:
a. Pruning
b. Testing
c. Validation
d. Training
Question 33
Correct
In hierarchical clustering, the distance between clusters is typically measured using the ______________ criterion.
Select one:
a. Euclidean distance
b. Linkage criterion
c. Manhattan distance
d. Cosine similarity
Question 34
Correct
Select one:
a. That the features in the data are normally distributed
Question 35
Incorrect
Select one:
a. Linear regression
b. K-nearest neighbors
Incorrect
Select one:
a. Filtering
b. Sampling
c. Normalizing
d. Cleaning
Question 37
Correct
Select one:
a. To solve systems of linear equations
Question 38
Correct
How does the Naive Bayes classifier calculate the probability of a data point belonging to a particular class?
Select one:
a. By using the least squares method
b. By using the gradient descent algorithm
Incorrect
Select one:
a. To optimize the use of resources
Question 40
Correct
Select one:
a. Sampling
b. Filtering
c. Normalizing
d. Cleaning
Question 41
Correct
Select one:
a. Grouping customers into different segments based on their spending habits
b. Determining whether an email is spam or not
d. Predicting the stock price for the next day based on historical data
Question 42
Correct
In hierarchical clustering, the final clusters are represented using a ______________ diagram.
Select one:
a. Dendrogram
b. Line graph
c. Bar chart
d. Scatter plot
Question 43
Incorrect
What is the main advantage of using a directed acyclic graph (DAG) over other types of graphs?
Select one:
a. DAGs are more efficient for storing and processing data
Question 44
Incorrect
The ______________ linkage criterion is a popular choice for hierarchical clustering, which merges the two clusters based on the
distance between their centroids.
Select one:
a. Complete
b. Single
c. Average
d. Centroid
Question 45
Correct
What is an example of a batch learning algorithm used for feature selection tasks?
Select one:
a. Recursive feature elimination
b. Variance threshold
c. Mutual information
Question 46
Correct
Select one:
a. K-means
d. Agglomerative clustering
Question 47
Correct
Select one:
a. Support vector machine
b. K-nearest neighbors
c. Decision tree
d. Linear regression
Question 48
Correct
Select one:
a. It may produce suboptimal results if the clusters are not spherical
Question 49
Correct
Select one:
a. Java 10
b. Java 8
c. Java 9
d. Java 7
Question 50
Correct
Select one:
a. Supervised learning involves predicting a value, while unsupervised learning involves clustering data
b. Supervised learning involves clustering data, while unsupervised learning involves predicting a value
c. Supervised learning involves labeled data, while unsupervised learning involves unlabeled data
d. Supervised learning involves predicting a continuous value, while unsupervised learning involves predicting a categorical
value
Jump to...