Data Mining MCQs

MCQs on Data Mining

Solve Data Mining Multiple-Choice Questions to prepare better for GATE. If you wish to learn more about Data Mining and Data Mining MCQs, you can check notes, mock tests, and previous years’ question papers. Gauge the pattern of MCQs on Data Mining by solving the ones that we have compiled below for your practice:

Data Mining Multiple-Choice Questions

1. Which of these is correct about data mining?

a. It is a procedure in which knowledge is mined from data.

b. It involves processes like Data Transformation, Data Integration, Data Cleaning.

c. It is a procedure using which one can extract information out of huge sets of data.

d. All of the above

Answer: (d) All of the above

2. The total categories of functions that are involved in Data Mining are:

a. 5

b. 4

c. 3

d. 2

Answer: (d) 2

3. The classification or mapping of a class using a predefined class or group is called:

a. Data Sub Structure

b. Data Set

c. Data Discrimination

d. Data Characterisation

Answer: (c) Data Discrimination

4. What is the analysis conducted for uncovering some interesting statistical correlations between various associated-attribute-value pairs called?

a. Mining of Clusters

b. Mining of Correlations

c. Mining of Association

d. None of the above

Answer: (b) Mining of Correlations

5. __________ are the data objects that don’t comply with the general model or behaviour of the available data:

a. Evolution Analysis

b. Outlier Analysis

c. Classification

d. Prediction

Answer: (b) Outlier Analysis

6. The issues of “Scalability and efficiency of the data mining algorithms” come under:

a. User Interaction and Mining Methodology Issues

b. Diverse Data Types Issues

c. Performance Issues

d. None of the above

Answer: (c) Performance Issues

7. In Data Warehousing, how many approaches are there for the integration of heterogeneous databases?

a. 5

b. 4

c. 3

d. 2

Answer: (d) 2

8. In Data Warehousing, which of these is the correct advantage of the Update-Driven Approach?

a. It provides high performance.

b. It can be processed, copied, annotated, integrated, restructured and summarised in advance in the semantic data store.

c. Both of the above

d. None of the above

Answer: (c) Both of the above

9. The primary use of data cleaning is:

a. Removing the noisy data

b. Correction of the data inconsistencies

c. Transformations for correcting the wrong data

d. All of the above

Answer: (d) All of the above

10. The classification of the Data Mining System consists of:

a. Machine Learning

b. Information Science

c. Database Technology

d. All of the above

Answer: (d) All of the above

11. Out of the following, which one is the proper application of data mining?

a. Fraud Detection

b. Market Management and Analysis

c. Risk Management & Corporate Analysis

d. All of the above

Answer: (d) All of the above

12. The class under study in Data Characterization is known as:

a. Final Class

b. Target Class

c. InitialClass

d. Study Class

Answer: (b) Target Class

13. ____________ is a sequence of patterns that frequently occur is called as:

a. Frequent Subsequence

b. Frequent Substructure

c. Frequent Item Set

d. All of the above

Answer: (a) Frequent Subsequence

14. __________ means the description and trends or model regularities for those objects whose behavior would change eventually over time.

a. Evolution Analysis

b. Outlier Analysis

c. Classification

d. Prediction

Answer: (a) Evolution Analysis

15. The issue of Pattern evaluation comes under which of these?

a. Performance Issues

b. Diverse Data Types Issues

c. User Interaction and Mining Methodology Issues

d. None of the above

Answer: (c) User Interaction and Mining Methodology Issues

16. The issue of “Handling complex and relational types of data” comes under:

a. User Interaction and Mining Methodology Issues

b. Diverse Data Types Issues

c. Performance Issues

d. None of the above

Answer: (b) Diverse Data Types Issues

17. In Data Warehousing, which of these is a valid disadvantage of the Query-Driven Approach?

a. Query Driven Approach is very expensive and very inefficient for frequent queries.

b. This approach is very expensive for those queries that need aggregations.

c. It requires complex processes of integration and filtering.

d. All of the above

Answer: (d) All of the above

18. The initial steps concerned in the process of knowledge discovery is:

a. Data Selection

b. Data Integration

c. Data Cleaning

d. Data Transformation

Answer: (c) Data Cleaning

19. Multiple numbers of data sources get combined in which step of the Knowledge Discovery?

a. Data Transformation

b. Data Selection

c. Data Integration

d. Data Cleaning

Answer: (c) Data Integration

20. The full form of DMQL is:

a. Data Marts Query Language

b. DBMiner Query Language

c. Dataset Mining Query Language

d. Data Mining Query Language

Answer: (d) Data Mining Query Language

  1. Which of the following is generally used in finding hidden structure and patterns in a given unlabelled data?
    1. Supervised learning
    2. Unsupervised learning
    3. Reinforcement learning
    4. None of the above

    Answer (b)

  2. Which of the following refers to obtaining information from the unstructured textual data?
    1. Information retrieval
    2. Information access
    3. Both (a) and (b)
    4. Neither (a) nor (b)

    Answer (a)

  3. The correct order in which all sub-processes of data mining executes is
    1. Infrastructure, Exploration, Analysis, Interpretation, Exploitation
    2. Infrastructure, Exploration, Analysis, Exploitation, Interpretation
    3. Exploration, Interpretation,Infrastructure, Analysis, Exploitation
    4. Exploration, Infrastructure, Exploitation, Analysis, Interpretation

    Answer (a)

  4. KDD stands for?
    1. Knowledge Discovery Database
    2. Knowledge Definition Data
    3. Knowledge Data Discovery
    4. Knowledge Data Definition

    Answer (a)

  5. Functions of Data Mining are
    1. Association and correctional analysis classification
    2. Prediction and characterization
    3. Cluster analysis and evolution analysis
    4. All of the above

    Answer (d)

  6. Which of the statements about hierarchical clustering is incorrect?
    1. The hierarchal clustering can mainly be used for the aim of exploration
    2. The hierarchal clustering can mainly be used for prediction
    3. Both (a) and (b)
    4. Neither (a) nor (b)

    Answer (a)

  7. The self-organising maps can be considered as __________
    1. Unsupervised learning
    2. Supervised learning
    3. Reinforcement learning
    4. None of the above

    Answer (b)

  8. Which of the following statements is true about the classification?
    1. It is a measure of accuracy
    2. It is a subdivision of a set
    3. It is the task of assigning a classification
    4. None of the above

    Answer (b)

  9. “Hybrid” is defined as
    1. Combining different types of method or information
    2. Information base filled with the knowledge of an expert
    3. The design of learning algorithms which are lined along the theory of evolution
    4. None of the above

    Answer (a)

  10. Which of the following is defined as the Euclidean distance measure?
    1. Finding the solution for a problem simply by summarising all possible solutions
    2. A KDD process stage in which new data is added to the existing selection
    3. Both (a) and (b)
    4. Neither (a) nor (b)

    Answer (a)

  11. _____________ can be considered as the correct application of data mining.
    1. Fraud detection
    2. Management and market analysis
    3. Corporate Analysis & Risk management
    4. All of the above

    Answer (d)

  12. _____________ is referred to as the Class study in data cauterization.
    1. First class
    2. Target class
    3. Final class
    4. All of the above

    Answer (b)

  13. _____________ refers to the sequence of patterns that occurs frequently.
    1. Frequent sub-sequence
    2. Frequent substitution
    3. Both (a) and (b)
    4. Neither (a) nor (b)

    Answer (a)

  14. “Handling the rational and complex types of data” comes under the __________ category.
    1. Diverse Data Type
    2. Performance issues
    3. Both (a) and (b)
    4. Neither (a) nor (b)

    Answer (a)

  15. ______________ is used as the first step in the knowledge discovery process.
    1. Data selection
    2. Data cleaning
    3. Data transfer
    4. None of the above

    Answer (b)

  16. The knowledge discovery process in which several data are combined _____________.
    1. Data integration
    2. Date selection
    3. Data cleaning
    4. None of the above

    Answer (a)

  17. Data Independence is referred to as
    1. Programs independent of the logical attributes
    2. Programs are dependent on the physical attributes of data
    3. Both (a) and (b)
    4. Neither (a) nor (b)

    Answer (c)

  18. ________________ generally used by the E-R model to represent the weak entities?
    1. Doubly outlined rectangle
    2. Dotted rectangle
    3. Both (a) and (b)
    4. Neither (a) nor (b)

    Answer (a)

  19. _______________ must be considered before investing in data mining.
    1. Functionality
    2. Compatibility
    3. Both (a) and (b)
    4. None of the above

    Answer (c)

Keep learning and stay tuned to get the latest updates on the GATE Exam along with GATE MCQs, GATE Eligibility Criteria, GATE Syllabus for CSE (Computer Science Engineering), GATE Notes for CSE, GATE CSE Question Paper, and more.

Leave a Comment

Your Mobile number and Email id will not be published.

*

*