MCQs on Data Mining

Solve Data Mining Multiple-Choice Questions to prepare better for GATE. If you wish to learn more about Data Mining and Data Mining MCQs, you can check notes, mock tests, and previous years’ question papers. Gauge the pattern of MCQs on Data Mining by solving the ones that we have compiled below for your practice:

Data Mining Multiple-Choice Questions

1. Which of these is correct about data mining?

a. It is a procedure in which knowledge is mined from data.

b. It involves processes like Data Transformation, Data Integration, Data Cleaning.

c. It is a procedure using which one can extract information out of huge sets of data.

d. All of the above

Answer: (d) All of the above

2. The total categories of functions that are involved in Data Mining are:

a. 5

b. 4

c. 3

d. 2

Answer: (d) 2

3. The classification or mapping of a class using a predefined class or group is called:

a. Data Sub Structure

b. Data Set

c. Data Discrimination

d. Data Characterisation

Answer: (c) Data Discrimination

4. What is the analysis conducted for uncovering some interesting statistical correlations between various associated-attribute-value pairs called?

a. Mining of Clusters

b. Mining of Correlations

c. Mining of Association

d. None of the above

Answer: (b) Mining of Correlations

5. __________ are the data objects that don’t comply with the general model or behaviour of the available data:

a. Evolution Analysis

b. Outlier Analysis

c. Classification

d. Prediction

Answer: (b) Outlier Analysis

6. The issues of “Scalability and efficiency of the data mining algorithms” come under:

a. User Interaction and Mining Methodology Issues

b. Diverse Data Types Issues

c. Performance Issues

d. None of the above

Answer: (c) Performance Issues

7. In Data Warehousing, how many approaches are there for the integration of heterogeneous databases?

a. 5

b. 4

c. 3

d. 2

Answer: (d) 2

8. In Data Warehousing, which of these is the correct advantage of the Update-Driven Approach?

a. It provides high performance.

b. It can be processed, copied, annotated, integrated, restructured and summarised in advance in the semantic data store.

c. Both of the above

d. None of the above

Answer: (c) Both of the above

9. The primary use of data cleaning is:

a. Removing the noisy data

b. Correction of the data inconsistencies

c. Transformations for correcting the wrong data

d. All of the above

Answer: (d) All of the above

10. The classification of the Data Mining System consists of:

a. Machine Learning

b. Information Science

c. Database Technology

d. All of the above

Answer: (d) All of the above

11. Out of the following, which one is the proper application of data mining?

a. Fraud Detection

b. Market Management and Analysis

c. Risk Management & Corporate Analysis

d. All of the above

Answer: (d) All of the above

12. The class under study in Data Characterization is known as:

a. Final Class

b. Target Class

c. InitialClass

d. Study Class

Answer: (b) Target Class

13. ____________ is a sequence of patterns that frequently occur is called as:

a. Frequent Subsequence

b. Frequent Substructure

c. Frequent Item Set

d. All of the above

Answer: (a) Frequent Subsequence

14. __________ means the description and trends or model regularities for those objects whose behavior would change eventually over time.

a. Evolution Analysis

b. Outlier Analysis

c. Classification

d. Prediction

Answer: (a) Evolution Analysis

15. The issue of Pattern evaluation comes under which of these?

a. Performance Issues

b. Diverse Data Types Issues

c. User Interaction and Mining Methodology Issues

d. None of the above

Answer: (c) User Interaction and Mining Methodology Issues

16. The issue of “Handling complex and relational types of data” comes under:

a. User Interaction and Mining Methodology Issues

b. Diverse Data Types Issues

c. Performance Issues

d. None of the above

Answer: (b) Diverse Data Types Issues

17. In Data Warehousing, which of these is a valid disadvantage of the Query-Driven Approach?

a. Query Driven Approach is very expensive and very inefficient for frequent queries.

b. This approach is very expensive for those queries that need aggregations.

c. It requires complex processes of integration and filtering.

d. All of the above

Answer: (d) All of the above

18. The initial steps concerned in the process of knowledge discovery is:

a. Data Selection

b. Data Integration

c. Data Cleaning

d. Data Transformation

Answer: (c) Data Cleaning

19. Multiple numbers of data sources get combined in which step of the Knowledge Discovery?

a. Data Transformation

b. Data Selection

c. Data Integration

d. Data Cleaning

Answer: (c) Data Integration

20. The full form of DMQL is:

a. Data Marts Query Language

b. DBMiner Query Language

c. Dataset Mining Query Language

d. Data Mining Query Language

Answer: (d) Data Mining Query Language

Which of the following is generally used in finding hidden structure and patterns in a given unlabelled data?

Supervised learning
Unsupervised learning
Reinforcement learning
None of the above

Answer (b)

Which of the following refers to obtaining information from the unstructured textual data?

Information retrieval
Information access
Both (a) and (b)
Neither (a) nor (b)

Answer (a)

The correct order in which all sub-processes of data mining executes is

Infrastructure, Exploration, Analysis, Interpretation, Exploitation
Infrastructure, Exploration, Analysis, Exploitation, Interpretation
Exploration, Interpretation,Infrastructure, Analysis, Exploitation
Exploration, Infrastructure, Exploitation, Analysis, Interpretation

Answer (a)

KDD stands for?

Knowledge Discovery Database
Knowledge Definition Data
Knowledge Data Discovery
Knowledge Data Definition

Answer (a)

Functions of Data Mining are

Association and correctional analysis classification
Prediction and characterization
Cluster analysis and evolution analysis
All of the above

Answer (d)

Which of the statements about hierarchical clustering is incorrect?

The hierarchal clustering can mainly be used for the aim of exploration
The hierarchal clustering can mainly be used for prediction
Both (a) and (b)
Neither (a) nor (b)

Answer (a)

The self-organising maps can be considered as __________

Unsupervised learning
Supervised learning
Reinforcement learning
None of the above

Answer (b)

Which of the following statements is true about the classification?

It is a measure of accuracy
It is a subdivision of a set
It is the task of assigning a classification
None of the above

Answer (b)

“Hybrid” is defined as

Combining different types of method or information
Information base filled with the knowledge of an expert
The design of learning algorithms which are lined along the theory of evolution
None of the above

Answer (a)

Which of the following is defined as the Euclidean distance measure?

Finding the solution for a problem simply by summarising all possible solutions
A KDD process stage in which new data is added to the existing selection
Both (a) and (b)
Neither (a) nor (b)

Answer (a)

_____________ can be considered as the correct application of data mining.

Fraud detection
Management and market analysis
Corporate Analysis & Risk management
All of the above

Answer (d)

_____________ is referred to as the Class study in data cauterization.

First class
Target class
Final class
All of the above

Answer (b)

_____________ refers to the sequence of patterns that occurs frequently.

Frequent sub-sequence
Frequent substitution
Both (a) and (b)
Neither (a) nor (b)

Answer (a)

“Handling the rational and complex types of data” comes under the __________ category.

Diverse Data Type
Performance issues
Both (a) and (b)
Neither (a) nor (b)

Answer (a)

______________ is used as the first step in the knowledge discovery process.

Data selection
Data cleaning
Data transfer
None of the above

Answer (b)

The knowledge discovery process in which several data are combined _____________.

Data integration
Date selection
Data cleaning
None of the above

Answer (a)

Data Independence is referred to as

Programs independent of the logical attributes
Programs are dependent on the physical attributes of data
Both (a) and (b)
Neither (a) nor (b)

Answer (c)

________________ generally used by the E-R model to represent the weak entities?

Doubly outlined rectangle
Dotted rectangle
Both (a) and (b)
Neither (a) nor (b)

Answer (a)

_______________ must be considered before investing in data mining.

Functionality
Compatibility
Both (a) and (b)
None of the above

Answer (c)

Keep learning and stay tuned to get the latest updates on the GATE Exam along with GATE MCQs, GATE Eligibility Criteria, GATE Syllabus for CSE (Computer Science Engineering), GATE Notes for CSE, GATE CSE Question Paper, and more.

MCQs on Data Mining

Data Mining Multiple-Choice Questions

Comments

Leave a Comment Cancel reply