Data Mining
Data set {brown, black, blue, green , red} is example of
Select one:
a. Continuous attribute
b. Ordinal attribute
c. Numeric attribute
d. Nominal attribute
Feedback
The correct answer is: Nominal attribute
Question 2
Which of the following activities is NOT a data mining task?
Select one:
a. Predicting the future stock price of a company using historical records
b. Monitoring and predicting failures in a hydropower plant
c. Extracting the frequencies of a sound wave
d. Monitoring the heart rate of a patient for abnormalities
Feedback
The correct answer is: Extracting the frequencies of a sound wave
Question 3
Question text
Data Visualization in mining cannot be done using
Select one:
a. Photos
b. Graphs
c. Charts
d. Information Graphics
Feedback
The correct answer is: Photos
Question 4
Which of the following is not a data pre-processing methods
Select one:
a. Data Visualization
b. Data Discretization
c. Data Cleaning
d. Data Reduction
Feedback
The correct answer is: Data Visualization
Question 5
Which of the following statement is not TRUE for a Tag Cloud
Select one:
a. Tag cloud is a visualization of statistics of user-generated tags
b. Tags may be listed alphabetically in a tag cloud
c. Tag cloud can be used for numeric data only
d. The importance of a tag is indicated by font size or color
Feedback
The correct answer is: Tag cloud can be used for numeric data only
Question 6
Dimensionality reduction reduces the data set size by removing _________
Select one:
a. composite attributes
b. derived attributes
c. relevant attributes
d. irrelevant attributes
Feedback
The correct answer is: irrelevant attributes
Question 7
The difference between supervised learning and unsupervised learning is given by
Select one:
a. unlike unsupervised learning, supervised learning needs labeled data
b. unlike unsupervised learning, supervised learning can be used to detect outliers
c. there is no difference
d. unlike supervised leaning, unsupervised learning can form new classes
Feedback
The correct answer is: unlike unsupervised learning, supervised learning needs labeled data
Question 8
Which of the following activities is a data mining task?
Select one:
a. Monitoring the heart rate of a patient for abnormalities
b. Extracting the frequencies of a sound wave
c. Predicting the outcomes of tossing a (fair) pair of dice
d. Dividing the customers of a company according to their profitability
Feedback
The correct answer is: Monitoring the heart rate of a patient for abnormalities
Question 9
Identify the example of sequence data
Select one:
a. weather forecast
b. data matrix
c. market basket data
d. genomic data
Feedback
The correct answer is: genomic data
Question 10
To detect fraudulent usage of credit cards, the following data mining task should be used
Select one:
a. Outlier analysis
b. prediction
c. association analysis
d. feature selection
Feedback
The correct answer is: Outlier analysis
Question 11
Which of the following is NOT example of ordinal attributes?
Select one:
a. Zip codes
b. Ordered numbers
c. Movie ratings
d. Military ranks
Feedback
The correct answer is: Zip codes
Question 12
Data scrubbing can be defined as
Select one:
a. Check field overloading
b. Delete redundant tuples
c. Use simple domain knowledge (e.g., postal code, spell-check) to detect errors and make corrections
d. Analyzing data to discover rules and relationship to detect violators
Feedback
The correct answer is: Use simple domain knowledge (e.g., postal code, spell-check) to detect errors and make corrections
Question 13
Which data mining task can be used for predicting wind velocities as a function of temperature, humidity, air pressure, etc.?
Select one:
a. Cluster Analysis
b. Regression
c. Clasification
d. Sequential pattern discovery
Feedback
The correct answer is: Regression
Question 14
In asymmetric attibute
Select one:
a. No value is considered important over other values
b. All values are equals
c. Only non-zero value is important
d. Range of values is important
Feedback
The correct answer is: Only non-zero value is important
Question 15
Which statement is not TRUE regarding a data mining task?
Select one:
a. Clustering is a descriptive data mining task
b. Classification is a predictive data mining task
c. Regression is a descriptive data mining task
d. Deviation detection is a predictive data mining task
Feedback
The correct answer is: Regression is a descriptive data mining task
Question 16
Identify the example of Nominal attribute
Select one:
a. Temperature
b. Salary
c. Mass
d. Gender
Feedback
The correct answer is: Gender
Question 17
Synonym for data mining is
Select one:
a. Data Warehouse
b. Knowledge discovery in database
c. Business intelligence
d. OLAP
Feedback
The correct answer is: Knowledge discovery in database
Question 18
Nominal and ordinal attributes can be collectively referred to as_________ attributes
Select one:
a. perfect
b. qualitative
c. consistent
d. optimized
Feedback
The correct answer is: qualitative
Question 19
Which of the following is not a data mining task?
Select one:
a. Feature Subset Detection
b. Association Rule Discovery
c. Regression
d. Sequential Pattern Discovery
Feedback
The correct answer is: Feature Subset Detection
Question 20
Which of the following is an Entity identification problem?
Select one:
a. One person with different email address
b. One person's name written in different way
c. Title for person
d. One person with multiple phone numbers
Feedback
The correct answer is: One person's name written in different way
Question 21
In Binning, we first sort data and partition into (equal-frequency) bins and then which of the following is not a valid step
Select one:
a. smooth by bin boundaries
b. smooth by bin median
c. smooth by bin means
d. smooth by bin values
Feedback
The correct answer is: smooth by bin values
Question 22
Incorrect or invalid data is known as _________
Select one:
a. Missing data
b. Outlier
c. Changing data
d. Noisy data
Feedback
The correct answer is: Noisy data
Question 23
The important characteristics of structured data are
Select one:
a. Sparsity, Resolution, Distribution, Tuples
b. Sparsity, Centroid, Distribution , Dimensionality
c. Resolution, Distribution, Dimensionality ,Objects
d. Dimensionality, Sparsity, Resolution, Distribution
Feedback
The correct answer is: Dimensionality, Sparsity, Resolution, Distribution
Question 24
Which of the following are descriptive data mining activities?
Select one:
a. Deviation detection
b. Classification
c. Clustering
d. Regression
Feedback
The correct answer is: Clustering
Question 25
In a data mining task where it is not clear what type of patterns could be interesting, the data mining system should
Select one:
a. allow interaction with the user to guide the mining process
b. perform both descriptive and predictive tasks
c. perform all possible data mining tasks
d. handle different granularities of data and patterns
Feedback
The correct answer is: allow interaction with the user to guide the mining process
Question 26
Correlation analysis is used for
Select one:
a. handling missing values
b. identifying redundant attributes
c. handling different data formats
d. eliminating noise
Feedback
The correct answer is: identifying redundant attributes
Question 27
The number of item sets of cardinality 4 from the items lists {A, B, C, D, E}
Select one:
a. 2
b. 10
c. 20
d. 5
Feedback
The correct answer is: 5
Question 28
Question text
Which of the following is NOT a data quality related issue?
Select one:
a. Missing values
b. Outlier records
c. Duplicate records
d. Attribute value range
Feedback
The correct answer is: Attribute value range
Question 29
Which of the following is not a Data discretization Method?
Select one:
a. Histogram analysis
b. Cluster Analysis
c. Data compression
d. Binning
Feedback
The correct answer is: Data compression
Question 30
Question text
Which of the following data mining task is known as Market Basket Analysis?
Select one:
a. Association Analysis
b. Regression
c. Clasification
d. Outlier Analysis
Feedback
The correct answer is: Association Analysis