Monday, 29 January 2018

                           Data Mining


Data set {brown, black, blue, green , red} is example of

Select one:
a. Continuous attribute 
b. Ordinal attribute
c. Numeric attribute
d. Nominal attribute
Feedback
The correct answer is: Nominal attribute

Question 2
Which of the following activities is NOT a data mining task?

Select one:
a. Predicting the future stock price of a company using historical records
b. Monitoring and predicting failures in a hydropower plant
c. Extracting the frequencies of a sound wave 
d. Monitoring the heart rate of a patient for abnormalities
Feedback
The correct answer is: Extracting the frequencies of a sound wave

Question 3
Question text
Data Visualization in mining cannot be done using
Select one:
a. Photos
b. Graphs
c. Charts
d. Information Graphics 
Feedback
The correct answer is: Photos

Question 4
Which of the following is not a data pre-processing methods
Select one:
a. Data Visualization
b. Data Discretization
c. Data Cleaning 
d. Data Reduction
Feedback
The correct answer is: Data Visualization

Question 5
Which of the following statement is not TRUE for a Tag Cloud
Select one:
a. Tag cloud is a visualization of statistics of user-generated tags 
b. Tags may be listed alphabetically in a tag cloud
c. Tag cloud can be used for numeric data only
d. The importance of a tag is indicated by font size or color
Feedback
The correct answer is: Tag cloud can be used for numeric data only

Question 6
Dimensionality reduction reduces the data set size by removing _________
Select one:
a. composite attributes
b. derived attributes
c. relevant attributes
d. irrelevant attributes 
Feedback
The correct answer is: irrelevant attributes

Question 7
The difference between supervised learning and unsupervised learning is given by
Select one:
a. unlike unsupervised learning, supervised learning needs labeled data 
b. unlike unsupervised learning, supervised learning can be used to detect outliers
c. there is no difference
d. unlike supervised leaning, unsupervised learning can form new classes

Feedback
The correct answer is: unlike unsupervised learning, supervised learning needs labeled data

Question 8
Which of the following activities is a data mining task?
Select one:
a. Monitoring the heart rate of a patient for abnormalities
b. Extracting the frequencies of a sound wave
c. Predicting the outcomes of tossing a (fair) pair of dice
d. Dividing the customers of a company according to their profitability 
Feedback
The correct answer is: Monitoring the heart rate of a patient for abnormalities

Question 9
Identify the example of sequence data
Select one:
a. weather forecast
b. data matrix 
c. market basket data
d. genomic data
Feedback
The correct answer is: genomic data

Question 10
To detect fraudulent usage of credit cards, the following data mining task should be used
Select one:
a. Outlier analysis 
b. prediction
c. association analysis
d. feature selection
Feedback
The correct answer is: Outlier analysis
Question 11
Which of the following is NOT example of ordinal attributes?
Select one:
a. Zip codes
b. Ordered numbers 
c. Movie ratings
d. Military ranks
Feedback
The correct answer is: Zip codes

Question 12
Data scrubbing can be defined as
Select one:
a. Check field overloading
b. Delete redundant tuples
c. Use simple domain knowledge (e.g., postal code, spell-check) to detect errors and make corrections 
d. Analyzing data to discover rules and relationship to detect violators

Feedback
The correct answer is: Use simple domain knowledge (e.g., postal code, spell-check) to detect errors and make corrections

Question 13
Which data mining task can be used for predicting wind velocities as a function of temperature, humidity, air pressure, etc.?

Select one:
a. Cluster Analysis
b. Regression 
c. Clasification
d. Sequential pattern discovery
Feedback
The correct answer is: Regression

Question 14
In asymmetric attibute
Select one:
a. No value is considered important over other values
b. All values are equals
c. Only non-zero value is important 
d. Range of values is important
Feedback
The correct answer is: Only non-zero value is important

Question 15
Which statement is not TRUE regarding a data mining task?
Select one:
a. Clustering is a descriptive data mining task
b. Classification is a predictive data mining task 
c. Regression is a descriptive data mining task
d. Deviation detection is a predictive data mining task
Feedback
The correct answer is: Regression is a descriptive data mining task

Question 16
Identify the example of Nominal attribute
Select one:
a. Temperature 
b. Salary
c. Mass
d. Gender
Feedback
The correct answer is: Gender

Question 17
Synonym for data mining is
Select one:
a. Data Warehouse 
b. Knowledge discovery in database
c. Business intelligence
d. OLAP
Feedback
The correct answer is: Knowledge discovery in database

Question 18
Nominal and ordinal attributes can be collectively referred to as_________ attributes
Select one:
a. perfect
b. qualitative 
c. consistent
d. optimized
Feedback
The correct answer is: qualitative

Question 19
Which of the following is not a data mining task?
Select one:
a. Feature Subset Detection 
b. Association Rule Discovery
c. Regression
d. Sequential Pattern Discovery
Feedback
The correct answer is: Feature Subset Detection

Question 20
Which of the following is an Entity identification problem?
Select one:
a. One person with different email address
b. One person's name written in different way 
c. Title for person
d. One person with multiple phone numbers
Feedback
The correct answer is: One person's name written in different way

Question 21
In Binning, we first sort data and partition into (equal-frequency) bins and then which of the following is not a valid step

Select one:
a. smooth by bin boundaries
b. smooth by bin median
c. smooth by bin means
d. smooth by bin values 
Feedback

The correct answer is: smooth by bin values

Question 22
Incorrect or invalid data is known as _________

Select one:
a. Missing data
b. Outlier
c. Changing data
d. Noisy data 
Feedback
The correct answer is: Noisy data

Question 23
The important characteristics of structured data are
Select one:
a. Sparsity, Resolution, Distribution, Tuples
b. Sparsity, Centroid, Distribution , Dimensionality
c. Resolution, Distribution, Dimensionality ,Objects
d. Dimensionality, Sparsity, Resolution, Distribution 
Feedback
The correct answer is: Dimensionality, Sparsity, Resolution, Distribution

Question 24
Which of the following are descriptive data mining activities?
Select one:
a. Deviation detection 
b. Classification
c. Clustering
d. Regression
Feedback

The correct answer is: Clustering

Question 25
In a data mining task where it is not clear what type of patterns could be interesting, the data mining system should

Select one:
a. allow interaction with the user to guide the mining process 
b. perform both descriptive and predictive tasks
c. perform all possible data mining tasks
d. handle different granularities of data and patterns
Feedback
The correct answer is: allow interaction with the user to guide the mining process

Question 26
Correlation analysis is used for

Select one:
a. handling missing values
b. identifying redundant attributes 
c. handling different data formats
d. eliminating noise
Feedback
The correct answer is: identifying redundant attributes

Question 27
The number of item sets of cardinality 4 from the items lists {A, B, C, D, E}

Select one:
a. 2
b. 10
c. 20 
d. 5
Feedback
The correct answer is: 5

Question 28
Question text
Which of the following is NOT a data quality related issue?
Select one:
a. Missing values
b. Outlier records
c. Duplicate records
d. Attribute value range 
Feedback
The correct answer is: Attribute value range

Question 29
Which of the following is not a Data discretization Method?
Select one:
a. Histogram analysis
b. Cluster Analysis
c. Data compression 
d. Binning
Feedback
The correct answer is: Data compression

Question 30
Question text
Which of the following data mining task is known as Market Basket Analysis?

Select one:
a. Association Analysis 
b. Regression
c. Clasification
d. Outlier Analysis
Feedback
The correct answer is: Association Analysis