Data Mining Methods Basics Quiz
1. Which of the following is not applicable to Data Mining?
Answer: Involves working with known information
2. The process of extracting valid, useful, unknown info from data and using it to make proactive knowledge driven business is called
Answer: Data mining
3. What is the other name for Data Preparation stage of Knowledge Discovery Process?
Answer: ETL
4. Which of the following role is responsible for performing validation on analysis datasets?
Answer: Statisticians
5. Which of the following activities is performed as part of data pre processing?
Answer: Detect Missing Values
6. Which of the following modelling type should be used for Labelled data?
Answer: Predictive Modelling
7. Noisy values are the values that are valid for the dataset, but are incorrectly recorded
Answer: True
8. Which statistical technique deals with finding a structure in a collection of unlabeled data?
Answer: Clustering
9. Probability of theft in an area is 0.03 with expected loss of 20% or 30% of things with probabilities 0.55 and 0.45. Insurance policy from A costs $150 pa with 100% repayment. Policy with B, costs $100 pa and first $500 of any loss has to be paid by the owner. Which data mining technique can be used to choose the policy?
Answer: Decision Tree
10. What is the type of learning where a function is inferred to describe hidden structure from unlabeled data
Answer: Unsupervised Learning
11. Statistical technique used for investigating and modelling the relationship between two or more variables is:
Answer: Regression analysis
12. If time is used as an independent variable in a simple linear regression analysis, which of the following assumptions could be violated?
Answer: Successive observations of the dependent variable are uncorrelated
13. Machine learning task of inferring a function from labelled training data is known as
Answer: Supervised Learning
14. Which is the statistical technique used for investigating and modelling the relationship between two or more variables?
Answer: Regression analysis
15. Regression is typically carried out to develop a mathematical model of the process
Answer: True
16. Associate rule is known as _____________
Answer: Affinity analysis
17. Which data mining method groups together objects that are similar to each other and dissimilar to the other objects?
Answer: Clustering
18. Which of the following activities are performed as part of data pre processing?
Answer: All the options
19. _________ are the values that mark the boundaries of the confidence interval.
Answer: Confidence limits
20. The process of extracting valid, useful, unknown info from data to make proactive knowledge driven business is called
Answer: Data mining
21. Simulations are carried out to develop a mathematical model of the process
Answer: False