Credit Risk, Classification, Churn Prediction & KNIME for Data Science Practice | MCQs

Q1. What is typically the biggest challenge in assessing credit risk?

  • A. High-income applicants
  • B. People with the best credit don't need loans, and those with the worst likely won’t repay
  • C. Bank staff being biased
  • D. Low interest rates
    Answer: B. People with the best credit don't need loans, and those with the worst likely won’t repay

Q2. Which of the following is considered when evaluating someone's credit history?

  • A. Tax returns
  • B. Loan type
  • C. Whether previous loans were paid on time
  • D. Age of the applicant
    Answer: C. Whether previous loans were paid on time

Q3. Why are middle-segment customers typically preferred by banks?

  • A. They demand higher interest
  • B. They are more likely to need loans and repay them
  • C. They have government backing
  • D. They always default
    Answer: B. They are more likely to need loans and repay them

Q4. What attribute does NOT typically influence credit risk?

  • A. Marital status
  • B. Annual income
  • C. Favorite color
  • D. Credit history
    Answer: C. Favorite color

Q5. Which of these is a common loan term parameter?

  • A. Type of currency
  • B. Duration of loan repayment
  • C. Loan repayment office
  • D. Weekly groceries
    Answer: B. Duration of loan repayment

Q6. Income of a loan applicant is an example of a:

  • A. Dependent variable
  • B. Noise variable
  • C. Independent variable
  • D. Categorical label
    Answer: C. Independent variable

Q7. A married couple applying for a home loan provides which kind of data?

  • A. Loan default record
  • B. Personal information
  • C. Output variable
  • D. Noise factor
    Answer: B. Personal information

Q8. Credit risk models are typically built using:

  • A. Surveys
  • B. Machine learning
  • C. Manual inspection
  • D. Stock market trends
    Answer: B. Machine learning

Q9. Successful credit risk modeling has led to:

  • A. Decrease in credit card usage
  • B. Proliferation of mortgages and credit cards
  • C. Higher default rates
  • D. Less bank automation
    Answer: B. Proliferation of mortgages and credit cards

Q10. A bank rejecting a loan because of poor credit score is a:

  • A. Random process
  • B. Classification decision
  • C. Regression decision
  • D. Time series output
    Answer: B. Classification decision

Q11. Credit risk classification is an example of:

  • A. Supervised learning
  • B. Unsupervised learning
  • C. Clustering
  • D. Reinforcement learning
    Answer: A. Supervised learning

Q12. Which of the following is the class label in credit risk classification?

  • A. Annual income
  • B. Home ownership
  • C. Loan default (Yes/No)
  • D. Marital status
    Answer: C. Loan default (Yes/No)

Q13. In a training set, what is meant by x?

  • A. Class label
  • B. Outcome
  • C. Attributes or input variables
  • D. Test case
    Answer: C. Attributes or input variables

Q14. What type of classifier technique uses similarity to known points?

  • A. Naïve Bayes
  • B. Decision Tree
  • C. Nearest-Neighbor
  • D. Logistic Regression
    Answer: C. Nearest-Neighbor

Q15. Which classifier method builds hierarchical tree-like structures?

  • A. Neural networks
  • B. K-means
  • C. Decision Trees
  • D. SVM
    Answer: C. Decision Trees

Q16. Which ensemble method uses multiple trees?

  • A. Naïve Bayes
  • B. Random Forest
  • C. KNN
  • D. SVM
    Answer: B. Random Forest

Q17. Why are decision trees easy to interpret?

  • A. They use linear equations
  • B. They model logic rules
  • C. They cluster data automatically
  • D. They need no training data
    Answer: B. They model logic rules

Q18. What is a necessary requirement for a decision tree model?

  • A. No class labels
  • B. Image data
  • C. Predefined discrete classes
  • D. Only numeric input
    Answer: C. Predefined discrete classes

Q19. Which of the following would most likely result in a loan default?

  • A. Income over $150K
  • B. Married and homeowner
  • C. No home, low income, single
  • D. Good credit history
    Answer: C. No home, low income, single

Q20. A model that learns from labeled examples is called:

  • A. Unsupervised
  • B. Reinforced
  • C. Supervised
  • D. Clustering
    Answer: C. Supervised

Q21. What does a decision tree split data based on?

  • A. Correlations
  • B. Random noise
  • C. Attribute values
  • D. Time
    Answer: C. Attribute values

Q22. In a decision tree, leaves represent:

  • A. Feature sets
  • B. Nodes
  • C. Final decisions
  • D. Regression equations
    Answer: C. Final decisions

Q23. Decision trees are useful because they:

  • A. Require no computation
  • B. Can represent rules
  • C. Always yield 100% accuracy
  • D. Are slower than neural nets
    Answer: B. Can represent rules

Q24. If two trees fit the same data, this is because:

  • A. Data is corrupted
  • B. Classification is not needed
  • C. Multiple rules can produce same results
  • D. Machine learning failed
    Answer: C. Multiple rules can produce same results

Q25. Classification assumes:

  • A. No training data
  • B. Continuous outcome variable
  • C. Predefined class labels
  • D. Supervised clustering
    Answer: C. Predefined class labels

Q26. A person who owns a home and earns >80K is:

  • A. Likely to default
  • B. Likely not to default
  • C. Indeterminate
  • D. Not a loan candidate
    Answer: B. Likely not to default

Q27. If a person is single, not a homeowner, and earns <80K, the decision tree would likely:

  • A. Approve the loan
  • B. Request more documents
  • C. Predict default
  • D. Ask for a co-signer
    Answer: C. Predict default

Q28. Marital status and home ownership are examples of:

  • A. Output labels
  • B. Class variables
  • C. Input attributes
  • D. Noise features
    Answer: C. Input attributes

Q29. What is used to test a decision tree after it is trained?

  • A. Training data
  • B. Evaluation metrics
  • C. Test set
  • D. Parameters
    Answer: C. Test set

Q30. What is the process of making predictions called?

  • A. Induction
  • B. Deduction
  • C. Regression
  • D. Forecasting
    Answer: B. Deduction

Q31. Which of the following is NOT typically used in a decision tree?

  • A. Root node
  • B. Leaf node
  • C. Branch
  • D. Circle loop
    Answer: D. Circle loop

Q32. A classification rule such as “If A and B, then Yes” is represented in:

  • A. Linear Regression
  • B. Naïve Bayes
  • C. Decision Tree
  • D. Clustering
    Answer: C. Decision Tree

Q33. The process of using training data to build a model is called:

  • A. Pruning
  • B. Testing
  • C. Training
  • D. Validating
    Answer: C. Training

Q34. Which classification technique is most sensitive to outliers?

  • A. Random Forest
  • B. Decision Trees
  • C. K-Nearest Neighbors
  • D. SVM
    Answer: C. K-Nearest Neighbors

Q35. Which of the following is true about overfitting?

  • A. The model generalizes well
  • B. It performs poorly on training data
  • C. It performs well on training but poorly on new data
  • D. It increases prediction accuracy
    Answer: C. It performs well on training but poorly on new data

Q36. Pruning in decision trees helps to:

  • A. Increase tree depth
  • B. Improve training accuracy
  • C. Reduce overfitting
  • D. Add more features
    Answer: C. Reduce overfitting

Q37. Which metric would you use to assess classification performance?

  • A. R-squared
  • B. Mean Absolute Error
  • C. Accuracy
  • D. Median
    Answer: C. Accuracy

Q38. In credit modeling, a confusion matrix helps analyze:

  • A. Decision speed
  • B. Loan tenure
  • C. Model performance
  • D. Customer feedback
    Answer: C. Model performance

Q39. A model that has high precision but low recall:

  • A. Returns many false positives
  • B. Misses many actual positives
  • C. Is ideal for all cases
  • D. Overfits always
    Answer: B. Misses many actual positives

Q40. Decision trees handle:

  • A. Only numeric data
  • B. Only categorical data
  • C. Both numeric and categorical data
  • D. Only binary output
    Answer: C. Both numeric and categorical data

Q51. Customer churn refers to:

  • A. Customers buying more
  • B. Customers canceling services or subscriptions
  • C. Returning customers
  • D. Product returns
    Answer: B. Customers canceling services or subscriptions

Q52. Churn is a key concern in:

  • A. Manufacturing
  • B. Telecom and banking
  • C. Construction
  • D. Government
    Answer: B. Telecom and banking

Q53. What kind of machine learning is churn prediction?

  • A. Unsupervised learning
  • B. Supervised learning
  • C. Reinforcement learning
  • D. Dimensionality reduction
    Answer: B. Supervised learning

Q54. In churn prediction, the class label is usually:

  • A. Type of service
  • B. Subscription duration
  • C. Churn (Yes/No)
  • D. Age group
    Answer: C. Churn (Yes/No)

Q55. A churn prediction model can help businesses to:

  • A. Expand manufacturing
  • B. Improve recruitment
  • C. Retain customers
  • D. Lower taxes
    Answer: C. Retain customers

Q56. Which of these features is least useful in churn prediction?

  • A. Customer complaints
  • B. Tenure with company
  • C. Monthly charges
  • D. Customer’s height
    Answer: D. Customer’s height

Q57. High monthly charges may indicate:

  • A. Happy customer
  • B. Potential churn
  • C. No correlation
  • D. Data error
    Answer: B. Potential churn

Q58. Churn prediction models must be:

  • A. Perfectly accurate
  • B. Interpretable and timely
  • C. Expensive and slow
  • D. Written in Java
    Answer: B. Interpretable and timely

Q59. An unbalanced dataset for churn prediction may have:

  • A. Equal churn and no-churn
  • B. Mostly churn cases
  • C. Mostly non-churn cases
  • D. Only new customers
    Answer: C. Mostly non-churn cases

Q60. In churn prediction, recall is more important than:

  • A. F1-score
  • B. Precision (in some cases)
  • C. Support
  • D. Accuracy
    Answer: B. Precision (in some cases)

Q61. Upselling to high-risk churn customers is:

  • A. Profitable strategy
  • B. Bad business logic
  • C. Churn mitigation
  • D. Data cleansing
    Answer: C. Churn mitigation

Q62. A false negative in churn prediction means:

  • A. Predicting churn when it’s not
  • B. Missing an actual churn
  • C. Predicting loyalty correctly
  • D. None of the above
    Answer: B. Missing an actual churn

Q63. The goal of churn prediction is to:

  • A. Hire better agents
  • B. Know which customers to offer retention deals to
  • C. Reduce server load
  • D. Create ads
    Answer: B. Know which customers to offer retention deals to

Q64. In logistic regression used for churn, the output is:

  • A. A class label directly
  • B. A probability between 0 and 1
  • C. A decision tree
  • D. A clustering group
    Answer: B. A probability between 0 and 1

Q65. The feature “Tenure < 1 year” might signal:

  • A. Loyal customer
  • B. High risk of churn
  • C. No relation
  • D. Data error
    Answer: B. High risk of churn

Q66. A high churn rate indicates:

  • A. High retention
  • B. Many new customers
  • C. Many lost customers
  • D. None of the above
    Answer: C. Many lost customers

Q67. Which action can reduce churn?

  • A. Ignoring complaints
  • B. Offering long-term discounts
  • C. Reducing support team
  • D. Increasing fees
    Answer: B. Offering long-term discounts

Q68. Which of the following is a retention strategy?

  • A. Cancelling all low-usage users
  • B. Sending loyalty rewards
  • C. Reducing data collection
  • D. Lowering response times
    Answer: B. Sending loyalty rewards

Q69. Segmenting customers by churn risk helps:

  • A. Reduce ads
  • B. Prioritize outreach
  • C. Increase complaints
  • D. Decrease accuracy
    Answer: B. Prioritize outreach

Q70. Decision tree models for churn are valued because:

  • A. They predict probabilities
  • B. They model time series
  • C. They provide simple if-then rules
  • D. They cluster customers
    Answer: C. They provide simple if-then rules

Q81. KNIME is primarily used for:

  • A. Video editing
  • B. Data analysis and machine learning
  • C. Web development
  • D. Drawing diagrams
    Answer: B. Data analysis and machine learning

Q82. KNIME stands for:

  • A. Knowledge Integration Mining Environment
  • B. Known Intelligent Machine Evaluator
  • C. Kernel in Machine Exploration
  • D. Knowledge Network Infrastructure for Modelling Experiments
    Answer: A. Knowledge Integration Mining Environment

Q83. In KNIME, what is a “workflow”?

  • A. A document file
  • B. A series of connected nodes
  • C. An Excel macro
  • D. A Python script
    Answer: B. A series of connected nodes

Q84. Which file type is commonly used to import data in KNIME?

  • A. .mp3
  • B. .xml
  • C. .csv
  • D. .exe
    Answer: C. .csv

Q85. KNIME nodes are:

  • A. Scripts
  • B. Plugins
  • C. Visual blocks with specific tasks
  • D. Databases
    Answer: C. Visual blocks with specific tasks

Q86. KNIME’s visual interface allows for:

  • A. Text-only inputs
  • B. Manual script writing only
  • C. Drag-and-drop modeling
  • D. GPU rendering
    Answer: C. Drag-and-drop modeling

Q87. What kind of learning does KNIME support?

  • A. Supervised only
  • B. Unsupervised only
  • C. Both supervised and unsupervised
  • D. None
    Answer: C. Both supervised and unsupervised

Q88. KNIME integrates with which programming language?

  • A. Java
  • B. Python
  • C. R
  • D. All of the above
    Answer: D. All of the above

Q89. KNIME workflows can be saved and:

  • A. Shared and reused
  • B. Only run once
  • C. Sent via email as MP3
  • D. Uploaded to Instagram
    Answer: A. Shared and reused

Q90. KNIME excels in:

  • A. High-end graphics
  • B. Text editing
  • C. Visual programming for data science
  • D. Backend development
    Answer: C. Visual programming for data science

Q91. To build a decision tree in KNIME, you must:

  • A. Write Java code
  • B. Connect a decision tree learner and predictor node
  • C. Train a deep learning model
  • D. Use Excel macros
    Answer: B. Connect a decision tree learner and predictor node

Q92. KNIME’s output nodes are used to:

  • A. Hide the data
  • B. Export or visualize results
  • C. Encrypt files
  • D. Delete logs
    Answer: B. Export or visualize results

Q93. KNIME supports integration with:

  • A. Azure
  • B. TensorFlow
  • C. H2O.ai
  • D. All of the above
    Answer: D. All of the above

Q94. What makes KNIME attractive for non-programmers?

  • A. Free food coupons
  • B. Command-line only tools
  • C. Visual, no-code workflow design
  • D. Mandatory Java knowledge
    Answer: C. Visual, no-code workflow design

Q95. You can automate workflows in KNIME by:

  • A. Repeating steps manually
  • B. Using workflow loops and scheduling
  • C. Installing more RAM
  • D. None of the above
    Answer: B. Using workflow loops and scheduling

Q96. Which node would you use to read an Excel file in KNIME?

  • A. Java Reader
  • B. Table Reader
  • C. Excel Reader
  • D. Data Generator
    Answer: C. Excel Reader

Q97. KNIME's key advantage over coding tools is:

  • A. Data loss
  • B. Visual drag-and-drop interface
  • C. More bugs
  • D. Poor compatibility
    Answer: B. Visual drag-and-drop interface

Q98. KNIME Server adds functionality such as:

  • A. No extra features
  • B. Scheduled workflow execution and collaboration
  • C. Video editing tools
  • D. Animation libraries
    Answer: B. Scheduled workflow execution and collaboration

Q99. KNIME offers:

  • A. Paid-only plans
  • B. A free open-source desktop version
  • C. Only cloud-based usage
  • D. Subscription-based email
    Answer: B. A free open-source desktop version

Q100. KNIME is best suited for:

  • A. Video streaming
  • B. Office automation
  • C. Data science and predictive analytics
  • D. Entertainment
    Answer: C. Data science and predictive analytics
Previous Post Next Post