1. Conveying uncertainty in accuracy measurements for machine learning models
  2. When to use Gaussian mixture model?

  3. In R, classification of categorical and continuous variable together in the same dataset

  4. How to design a complex machine learning system where individual classifiers can be retrained without modifying rest of the system?
  5. Is it possible, to compare classifiers using Dietterich's 5x2cv paired t test and Matthew's correlation coefficient as an "error" metric?
  6. R: How to determine useful features by Mean Decrease in Accuracy (MDA) in random forest algorithm?
  7. neural netork loss function for hierarchical classification

  8. Probabilistic classification using kernel density estimation
  9. How to deal with a highly unbalanced classification problem?

  10. How can I interpret the results of LSA?
  11. Prediction vs. Classification in neural networks

  12. Is there a Bayesian decision rule based on likelihood alone? And if so, what is its error rate?
  13. Under what circumstance does $P(X|H) = P(H|X)$?

  14. J48 decision trees in weka
  15. Classification vs Linear Regression

  16. How to choose the cutoff probability for a rare event Logistic Regression

  17. R, rpart(), classification tree, cptable$xerror

  18. Classification probability threshold

  19. KNN and K-folding in R

  20. Why will the validation set error underestimate the generalisation error?

  21. Poor performance of binary classification with DCNNs

  22. How to improve classification based on 2D distance between classes

  23. What do the thresholds on x and y axis of ROC curve represent?
  24. Understanding the math behind linear discriminant analysis

  25. How to understand confusion matrix for 3x3

  26. Convolutionalizing fully connected layers to form an FCN in Keras
  27. State of the art results on Cifar-10

  28. Is it possible to design a global image classifier?
  29. Text classification based on keywords

  30. Modulation Signal for a Neural Network

  31. Recommender System + Collaborative filtering without users
  32. Many binary classifiers vs. single multiclass classifier
  33. Machine Learning model for dealing with Curse of Dimensionality

  34. Find patterns in multidimensional time series with few examples per class and possible class duration variation

  35. What is the best way to use Latitude and Longitude features in building a Machine Learning model?
  36. Are Decision Trees well-suited for Sentiment Analysis (of tweets)?
  37. Comprehensive evaluation measure!

  38. I care about the precision of a few of the classes, how would I write a custom XGBoost objective function?

  39. How to make predictions using multiclass unbalanced data?
  40. Comparing supervised text classification algorithms with unlabeled documents from web

  41. Feed guess features into unsupervised learnin classification?

  42. Reversed Naive Bayes - likelihood and parameter estimation
  43. Bayes risk for Bayesian classifier with multivariate Gaussian

  44. SVM Kernel confusion

  45. Supervised learning: setting labels on sliding windows of sensor data
  46. Information gain and mutual information: different or equal?
  47. How can I implement a CRF feature function?
  48. Algorithm recommendation: Short string classification/matching (fuzzy string matching or machine learning ?)

  49. Time Series Classification?
  50. How to quantify prediction error of a continuous target variable?
  51. Why LR and linear classification model doesn't work on one-hot encoded data?

  52. Giving weights to classes in classification problem to ensure correct prediction of certain classes

  53. How to find the accuracy of regression data using XGboost model?

  54. Viterbi Algorithm for non probabilistic classifier
  55. Are Voronoi diagrams used in kNN algo implementations?
  56. Help interpreting formula for multi-class hinge loss

  57. Pros and cons of Logistic Regression, Naive Bayes, Random Forest, Tree and kNN
  58. Data augmentation and effective class imbalance

  59. How to improve classification performance based on multiple known classification results

  60. Classification with confidence scores: is regression ok?

  61. Alternative to Shannon's entropy when probability equal to zero

  62. Nature of variations captured my first few principal components.
  63. On what basis can we combine levels in a factor variable when the target variable is binary?

  64. How to draw confusion matrix for this classifier?

  65. How to solve a multi-class and multi-label problem?

  66. Creating a predictive model based on past customer data.
  67. Erratic learning curves diagnosis?

  68. What train/test accuracy to expect from various classifiers on 6-multiclass problem?

  69. Setting up a MLP for binary classification with tensorflow

  70. How to plot boundary line of binary classification algorithm

  71. What is the difference between explicit and implicit mapping in SVM?
  72. How to compare the accuracy of two different models using statistical significance

  73. Is there something fundamentally wrong with using the same features/learning algorithm for the same data set given different labels?

  74. Estimating performance of a binary classifier
  75. How to compute F-measure and accuracy for repeated cross-validation

  76. One class of Ordinal DV values has too few observations - best way to address

  77. Optimal number of components in a Gaussian mixture

  78. Fishers Exact Test construction
  79. Precision and recall are equal when the size is same

  80. How to select predictor variables for a classification model?

  81. Is f-measure synonymous with accuracy?
  82. Classification using categorical and text data

  83. In a real application, how to improve the CNN performance?
  84. Applying the Bias-Variance tradeoff to calculate k in K-nearest-neighbour
  85. Does the Discriminatory Power of a Set of Variables Necessarily Depend on the Correlation or Mutual Information with the Classifier?

  86. Calculating Area Under Curve

  87. Is AUC the probability of correctly classifying a randomly selected instance from each class?
  88. Train a Neural Network to distinguish between even and odd numbers

  89. Is KNN a discriminative learning algorithm?
  90. Random Forest training ROC score monotonically increasing with max_depth

  91. How to subset alternatives in nested multinomial logistic regression?

  92. Is the log loss function $f(w) = y_t \log(y_p) + (1 - y_t) \log(1 - y_p)$ convex in $w$?

  93. How to identify linearly separable datasets
  94. Can't get Accuracy above 15% on CIFAR-10 dataset

  95. Regarding the size of training data for building classifier
  96. optimizing auc vs logloss in binary classification problems

  97. Classification when each level has probability assigned to it

  98. How to not overlook rare but important features when preventing over-fitting in a decision tree?

  99. K-means and maximum likelihood!

  100. Random Forests - Regression or Classification?