Are Kaggle Competitions Worth It? Ponderings of a Kaggle Grandmaster

I would not have a data science career without Kaggle. So if you are looking for a blog post bashing Kaggle, this is not the place. Competing on Kaggle is worth it! It will teach you a lot about machine learning, give you a project to talk about in interviews, which can even help you get a job! That said, I am not a radical that thinks Kaggle is the ultimate thing that everyone must do in order to become a data scientist....

April 5, 2023 路 15 min 路 Mario Filho

How Writing Journals Helped Me Become a Better Data Scientist

I have been using daily machine learning development journals for the past 5 years. These are simple diaries that help me organize my thoughts, come up with new ideas and track my progress. I have found them to be an invaluable tool for helping me improve as a data scientist, so I decided to share my experience, as you can find it useful too. 馃檪 What is a Daily Machine Learning Journal?...

May 19, 2022 路 8 min 路 Mario Filho

How To Use Machine Learning In Your Company Without Hiring a Full-Time Data Scientist

The days when you needed to hire a full-time data scientist to use machine learning to solve a business problem in your company are long gone. Now, machine learning is more accessible than ever and there are several ways you can start using it in your business without breaking the bank. Here are a few steps you should take to build your first ML-powered solution. Identify A Measurable Business Problem Start by identifying a business problem that machine learning could help solve....

May 17, 2022 路 7 min 路 Mario Filho

Can Gradient Boosting Learn Simple Arithmetic?

During a technical meeting a few weeks ago, we had a discussion about feature interactions, and how far we have to go with them so that we can capture possible relationships with our targets. Should we create (and select) arithmetic interactions between our features? This is not something we usually learn in machine learning courses, but it鈥檚 a very important topic to better understand how our models work. A few years ago I remember visiting a website that showed how different models approximated these simple operations....

January 20, 2020 路 4 min 路 Mario Filho