- Introduction
- CatBoost vs XGBoost Battle 1
- CatBoost vs XGBoost Battle 2
- CatBoost vs XGBoost Battle 3
- Let's look at CatBoostClassifier and CatBoostRegressor
- Conclusion
What you'll learn
- Learn how to use CatBoost in regression and classification with Python
Description
XGBoost is one of the most powerful boosted models in existence until now... here comes CatBoost. Let's explore how it compares to XGBoost using Python and also explore CatBoost on both a classification dataset and a regression one. Let's have some fun!
Part 1
We're going to start by unleashing XGBoost and CatBoostost on an independent data set version of the Titanic - the ship's manifest of those that did and didn't survive the tragic sinking of the ship in the North Atlantic Ocean. It happened in 1912 after hitting an iceberg on its maiden voyage to New York. You probably have already used it as it is extremely predictive, basically, women, children and the rich survived while men and the poor mostly didn't.
Part 2
In the second part, we'll model a linear regression and classification on the titanic for classification and the Boston housing data.I'll also introduce you to a cool tool - Pandas Profiler for quick EDAs.
Please go out and use this model on a Kaggle competition, get an account if you haven't already and experiment - sometimes follow the rules, sometimes, don't. Remember that data science is very new so we're still inventing things as we go, just like these new models allow us to explore a little further and further each time!
Other Courses
Ethereum Development Course - Blockchain at Berkeley
This course will teach you how to become an experienced solidity developer
Healthcare IT Project Manager Interview Bootcamp
Prepare & pass the most rigorous interview for Healthcare IT Project Manager
Piano Hack – Learn 4 Tunes & Sound Like a Pro!
Learn to play 4 easy boogie, jazz, classical and syncopation pieces.
Meeshkan: Machine Learning the GitHub API
Learn how to plan, deploy and run a Machine Learning problem on AWS and Meeshkan
About the instructors
- 4.43 Calificación
- 51717 Estudiantes
- 11 Cursos
Manuel Amunategui
Data Scientist & Quantitative Developer
Data scientist with over 20-years experience in the tech industry, MAs in Predictive Analytics and International Administration, author of Monetizing Machine Learning and The Little Book of Fundamental Indicators, founder of FastML, reached top 1% on Kaggle and awarded "Competitions Expert" title, taught over 20,000 students on Udemy and VP of Data Science at SpringML.
From consulting in machine learning, healthcare modeling, 6 years on Wall Street in the financial industry, and 4 years at Microsoft, I feel like I’ve seen it all. And this has opened my eyes to the huge gap in educational material on applied data science. Like I say:
"It just ain’t real 'til it reaches your customer’s plate"
I am a startup advisor and available for speaking engagements with companies and schools on topics around building and motivating data science teams, and all things applied to machine learning.
Reach me at [email protected]
Student feedback
Course Rating
Reviews
It was easy to follow. Introducing pandas profiling was a big surprise.
Good Course
Great course and well explained with a steady pace to keep interest but not not be boring at the same time.
Concise. Informative. Right to the point on implementation!