Data Science
Data science professionals, Machine Learning specialists, and those interested in learning more about the field
Solved Questions
Unsolved Questions
  1. Solving linear classification task with deep network
  2. NLP for appartement ads
  3. What it Would be easier Building a Deep Net From Scratch or Using an existing Architecture?

  4. Amount of data needed and hypothesis for SVD
  5. Is supervised machine learning by definition predictive?
  6. How to deal with a sparse matrix when using a perceptron based recommender system?

  7. Updating the weights of the filters in a CNN
  8. Use 1 or 2 norm for Voronoi vector quantization?

  9. How to search for an optimal dithering pattern?

  10. How does kera calculate accuracy for multi label classification?

  11. What are limitations of decision tree approaches to data analysis?
  12. Explain Binary Classification with output 0.5 (True)

  13. Do I need an Artificial Intelligence API?
  14. Machine Learning - Same impurity values
  15. How to detect anomalous points

  16. Predicting Missing Features

  17. How to apply SelectKBest in azure machine learning

  18. Publicly Available Datasets
  19. How is that possible that a reward function depends both on the next state and an action from current state?

  20. Datasource for regression model prediction : Machine Learning
  21. Can dropout and batch normalization be applied to convolution layers

  22. Knime : Scatter Plot

  23. Columns with no (or nearly no) differences between rows worth keeping?
  24. Spliting training data into multiple variables using R
  25. Data amount for a very simple chatbot

  26. What is the purpose of untrainable weights in Keras

  27. Multi-class text classification with LSTM in Keras

  28. Reinforcement Learning - What's the formula for the value function
  29. Word embedding vectors for keyphrase extraction
  30. Multi GPU in keras
  31. Idf values of English words

  32. How to analyse 2 data set to make an categorize and recommendation model

  33. Pre-train using sigmoid and train using ReLU?

  34. How to treat outliers in a time series dataset?

  35. Training the parameters of a Restricted Boltzman machine

  36. Quantifying 'growth friction' when projecting target goals

  37. Sample weights changing predictions even if all weights are the same

  38. Checking Correlation of Categorical variables in SPSS
  39. How to print out to a file using Standford Classifier

  40. Some questions about GoogleNet paper

  41. How to get stanford universal dependencies in python NLTK in specific format

  42. How to generate training data for OCR
  43. What is the appropriate learning algorithm?

  44. Using scikit-learn FeatureHasher
  45. SMOTE and multi class oversampling
  46. Predicting three related scores with one joint model

  47. how can i sum first day value of each id together?

  48. Distance measure for ternary feature

  49. svm KKT condition violation that alpha choice in SMO
  50. What is the difference between data-driven methods and machine learning?
  51. How to group data and plot line graphs

  52. Parse and Scrape ecommerce websites with a generalizable and scalable approach
  53. How should I preprocess WORD ID?
  54. What is the difference between "expected return" and "expected reward" in the context of RL?

  55. How can autoencoders be used for clustering?
  56. What ML algorithm can I use for building a "recommended" list for players?
  57. What is representation in optical character recognition?
  58. How do I go from number data to string data (NN)?

  59. How to make unstructured data to structured data?
  60. High, constant training loss with CNN

  61. Name of ML discipline for trainable clusterer - supervised clustering, or group ID assignment

  62. Parameter estimation: reduce time

  63. How to deal with time series which change in seasonality or other patterns?
  64. How to replace levels with certain value?

  65. How to think about prediction error that is not convex in hyperparameter, or over the course of training

  66. Notion of cluster centers and cluster comparison in Density Based Algorithms
  67. Pass data to CNN with multiple outputs in eras
  68. Keyword extraction using deep learning

  69. Deconvolution vs Sub-pixel Convolution
  70. Age range prediction according to browsing history
  71. Clustering algorithms for high dimensional binary sparse data
  72. What exactly is a class in kNN search?
  73. About training and cross validation on a time series problem
  74. Assessing Group Similarities and Dissimilarities Post PCA

  75. How differential semi-gradient Sarsa updates estimated average reward?
  76. Text annotating process, quality vs quantity?
  77. Train/Test Split after perform SMOTE

  78. Using standard ML models for modeling a derivative when the data set only contains function values

  79. Monte Carlo and K-fold cross-validation
  80. One hot encoding alternatives for large categorical values?

  81. Business exception reporting
  82. What is the best technique/algorithm to compare trees changes?

  83. Filter unwanted terms

  84. Using ML to create unique descriptors?
  85. Filtering periodic data noise

  86. Classify future performance of customer
  87. How do you train a machine learning model to learn how to adjust images (contrast, brightness, saturation, crop etc.) from manually adjusted images?
  88. Multi-Source Time Series Data Prediction
  89. How to determine feature importance while using xgboost in pipeline?

  90. Documentation for package rnn in R

  91. AttributeError: 'numpy.ndarray' object has no attribute 'predict'

  92. Tensorflow Train Cnn on Multi-Gpu With Dataset api Does Not work

  93. sklearn SVM really slow
  94. What is LSTM, BiLSTM and when to use it?
  95. What are possible approaches to deal with unseparable data even after PCA?
  96. Error when checking : expected dense_1_input to have shape (None, 5) but got array with shape (200, 1)
  97. A/B testing: How to calculate p-value on post test segments?
  98. LVQ (Learning vector quantization) not starting on MNIST dataset?

  99. How to approach creating a question answering bot on a specific domain?

  100. Time series prediction without sliding window