Q: For a one-hot encoded feature, what can you do if new data contains categories that weren't seen during training?
A: Set handle_unknown='ignore' to encode new categories as all zeros.

P.S. If you know all possible categories that might ever appear, you can instead specify the categories manually. handle_unknown='ignore' is useful specifically when you don't know all possible categories.

New tips every TUESDAY and THURSDAY!

Watch all tips: https://www.youtube.com/playlist?list=PL5-da3qGB5ID7YYAqireYEew2mWVvgmj6
Code for all tips: https://github.com/justmarkham/scikit-learn-tips
Get tips via email: https://scikit-learn.tips


=== WANT TO GET BETTER AT MACHINE LEARNING? ===

1) WATCH my video series: https://www.youtube.com/playlist?list=PL5-da3qGB5ICeMbQuqbbCOQWcS6OYBr5A

2) ENROLL in my courses: https://www.dataschool.io/ml-courses/

3) LET'S CONNECT!
- Newsletter: https://www.dataschool.io/subscribe/
- Twitter: https://twitter.com/justmarkham
- Facebook: https://www.facebook.com/DataScienceSchool/
- LinkedIn: https://www.linkedin.com/in/justmarkham/