- shuffling data in the mini-batch training of neural network
- Dealing with model assumption violation (homogeneity of regression coefficients for ANCOVA)
- ACF of MA(q) cuts off after lag q - but isn't it also AR(∞)?
- R BSTS prediction errors
- Interpreting test results on log-transformed data
- Detrending a time series
- R - Confused on Residual Terminology
- Sampling distribution of regression coefficients
- Do you know any 3D engines written in Objective-C for iOS?
- How can I create a 'SdkTrayManager' without 'OIS::Mouse'?
- Is this scheduling operation a slowdown for the CPU, or is it a benefit?
- Is networked gameplay inherently hard to debug and create?
- Are there any good ways to know if a resource is patented trademark or illegal to use?
- Is it worth taking the time and making a game scripting engine instead of directly coding?
- making a sourdough starter in desert like conditions
- error I can not see my webpage
- Shoot bows faster
- Exiting 'Print Layout' when viewing a read-only Google Doc
- Google Document web-layout view mode similarly in Microsoft's Word
Naive Bayes Should generate prediction given missing features (scikit learn)
Seeing that Naive Bayes uses probability to make a prediction, and treats features as being conditionally independent of each other, then it makes sense that the model can still make a prediction given that there are some features missing in the test data.
I know that it is common practice to impute missing data, but why do this when Naïve Bayes should be able to make a prediction given that there are some features missing?
Can this be implemented in sci-kit learn? I tried a test set with less features, and got a ValueError as the shapes are not aligned.
So theoretically this is possible, but is it possible in scikit learn?
Your question is sensible. The way in which posterior probability is calculated in the classical Naive Bayes classifier (in sklearn) is like summation of the conditional probabilities of the all the features in the dataset. Even though the features are treated as conditionally independent, to learn the classification probability all the feat
Your question is sensible. The way in which posterior probability is calculated in the classical Naive Bayes classifier (in sklearn) is like summation of the conditional probabilities of the all the features in the dataset. Even though the features are treated as conditionally independent, to learn the classification probability all the features are always used in this setup. Once the model has been learned you still all those features to calculate the posterior for a new observation. The conditional independence is just an assumption that is taken to make the statistics and math obey the rules and work.
But slightly modifying the way in which the posterior is calculated you can use Bayesian approach to make predictions even with the absence of certain features. Using Bayesian approach to make predictions in the absence of certain features is still an ongoing work. You may want to have a look at this paper in which Bayesian approach is applied to astronomy to do classification with2017-03-20 23:57:20