Feature Engineering for Machine Learning

Learn imputation, variable encoding, discretization, feature extraction, how to work with datetime, outliers, and more.

4.68 (3205 reviews)
Udemy
platform
English
language
Data Science
category
instructor
Feature Engineering for Machine Learning
22,869
students
13.5 hours
content
Apr 2024
last update
$89.99
regular price

What you will learn

Learn multiple techniques for missing data imputation.

Transform categorical variables into numbers while capturing meaningful information.

Learn how to deal with infrequent, rare, and unseen categories.

Learn how to work with skewed variables.

Convert numerical variables into discrete ones.

Remove outliers from your variables.

Extract useful features from dates and time variables.

Learn techniques used in organizations worldwide and in data competitions.

Increase your repertoire of techniques to preprocess data and build more powerful machine learning models.

Why take this course?

Welcome to Feature Engineering for Machine Learning, the most comprehensive course on feature engineering available online. In this course, you will learn about variable imputation, variable encoding, feature transformation, discretization, and how to create new features from your data.


Master Feature Engineering and Feature Extraction.

In this course, you will learn multiple feature engineering methods that will allow you to transform your data and leave it ready to train machine learning models. Specifically, you will learn:


  • How to impute missing data

  • How to encode categorical variables

  • How to transform numerical variables and change their distribution

  • How to perform discretization

  • How to remove outliers

  • How to extract features from date and time

  • How to create new features from existing ones


Create useful Features with Math, Statistics and Domain Knowledge

Feature engineering is the process of transforming existing features or creating new variables for use in machine learning. Raw data is not suitable to train machine learning algorithms. Instead, data scientists devote a lot of time to data preprocessing. This course teaches you everything you need to know to leave your data ready to train your models.


While most online courses will teach you the very basics of feature engineering, like imputing variables with the mean or transforming categorical variables using one hot encoding, this course will teach you that, and much, much more.


In this course, you will first learn the most popular and widely used techniques for variable engineering, like mean and median imputation, one-hot encoding, transformation with logarithm, and discretization. Then, you will discover more advanced methods that capture information while encoding or transforming your variables to improve the performance of machine learning models.


You will learn methods like the weight of evidence, used in finance, and how to create monotonic relationships between variables and targets to boost the performance of linear models. You will also learn how to create features from date and time variables and how to handle categorical variables with a lot of categories.


The methods that you will learn were described in scientific articles, are used in data science competitions, and are commonly utilized in organizations. And what’s more, they can be easily implemented by utilizing Python's open-source libraries!

Throughout the lectures, you’ll find detailed explanations of each technique and a discussion about their advantages, limitations, and underlying assumptions, followed by the best programming practices to implement them in Python.


By the end of the course, you will be able to decide which feature engineering technique you need based on the variable characteristics and the models you wish to train. And you will also be well placed to test various transformation methods and let your models decide which ones work best.


Step-up your Career in Data Science

You’ve taken your first steps into data science. You know about the most commonly used prediction models. You've even trained a few linear regression or classification models. At this stage, you’re probably starting to find some challenges: your data is dirty, lots of values are missing, some variables are not numerical, and others extremely skewed. You may also wonder whether your code is efficient and performant or if there is a better way to program. You search online, but you can’t find consolidated resources on feature engineering. Maybe just blogs? So you may start to wonder: how are things really done in tech companies?


In this course, you will find answers to those questions. Throughout the course, you will learn multiple techniques for the different aspects of variable transformation, and how to implement them in an elegant, efficient, and professional manner using Python. You will leverage the power of Python’s open source ecosystem, including the libraries NumPy, Pandas, Scikit-learn, and special packages for feature engineering: Feature-engine and Category encoders.


By the end of the course, you will be able to implement all your feature engineering steps into a single elegant pipeline, which will allow you to put your predictive models into production with maximum efficiency.


Leverage the Power of Open Source

We will perform all feature engineering methods utilizing Pandas and Numpy, and we will compare the implementation with Scikit-learn, Feature-engine, and Category encoders, highlighting the advantages and limitations of each library. As you progress in the course, you will be able to choose the library you like the most to carry out your projects.

There is a dedicated Python notebook with code to implement each feature engineering method, which you can reuse in your projects to speed up the development of your machine learning models.


The Most Comprehensive Online Course for Feature Engineering

There is no one single place to go to learn about feature engineering. It involves hours of searching on the web to find out what people are doing to get the most out of their data.


That is why, this course gathers plenty of techniques used worldwide for feature transformation, learnt from data competitions in Kaggle and the KDD, scientific articles, and from the instructor’s experience as a data scientist. This course therefore provides a source of reference where you can learn new methods and also revisit the techniques and code needed to modify variables whenever you need to.


This course is taught by a lead data scientist with experience in the use of machine learning in finance and insurance, who is also a book author and the lead developer of a Python open source library for feature engineering. And there is more:


  • The course is constantly updated to include new feature engineering methods.

  • Notebooks are regularly refreshed to ensure all methods are carried out with the latest releases of the Python libraries, so your code will never break.

  • The course combines videos, presentations, and Jupyter notebooks to explain the methods and show their implementation in Python.

  • The curriculum was developed over a period of four years with continuous research in the field of feature engineering to bring you the latest technologies, tools, and trends.


Want to know more? Read on...

This comprehensive feature engineering course contains over 100 lectures spread across approximately 10 hours of video, and ALL topics include hands-on Python code examples that you can use for reference, practice, and reuse in your own projects.


REMEMBER, the course comes with a 30-day money-back guarantee, so you can sign up today with no risk.


So what are you waiting for? Enrol today and join the world's most comprehensive course on feature engineering for machine learning.

Screenshots

Feature Engineering for Machine Learning - Screenshot_01Feature Engineering for Machine Learning - Screenshot_02Feature Engineering for Machine Learning - Screenshot_03Feature Engineering for Machine Learning - Screenshot_04

Reviews

Antony
February 15, 2024
A very well structured course, meets the high standards of Soledad's other courses consistently, and I also liked that she had patiently answered student's questions in the Q&A
Eduard
February 8, 2024
Feature Engine met and even exceeded all my expectations. Needless to say that the author is very knowledgeable in feature engineering since she is the creator of this must-have library. It took a while in the beginning to get used with the accent, but after that is was a breeze. Highly recommend!!!
Suhas
February 2, 2024
I loved very much this course. There is so much to do with FE nd it can improve the performance of a model.
Nanda
January 10, 2024
Course gives a lot of information, but I miss a code-along or exercises part in which I can work with the techniques myself. Therefore I find the course more helpful for searching a solution when working on a project then for learning the principles of feature engineering.
Ethanxsun
December 20, 2023
The lecture provides excellent code samples and comprehensive explaination on the subject for a deep-dive understanding of feature engieering. Thank you!
Prashant
December 13, 2023
Excellent Explanation , I learnt a lot about feature engineering .If assignment would have added will help more to grasp the concept .
Michele
November 20, 2023
A very comprehensive approach to the subject. I wish all the courses on Udemy were as well made and thought of as this one. Thank you so much
Enrique
November 17, 2023
Even the instructor created a library only for this course, amazing. I don't like to do Feature- Engineering, but this course helped me a lot with my job and professional profile, thank you Soledad, i will be like you in the future.
鬼頭昌哉
October 29, 2023
Great coures! I have learned so much about feature engineering. I'm so impressed to have learned the practical way of feature engineering. Thank you.
VINAY
May 25, 2023
Sole is highly knowledgeable Data Scientist and all of her courses contains quality PPT's, Datasets and Codes. If anyone wants to master the ML pipelines then Sole is one-stop solution ranging from Development to Deployment. I have really enjoyed all of the video series and bug free tutorials. Looking forward to learn more through her courses. Regards VINAY
Quinn
May 24, 2023
This course couldn't get any better. The preparation that went into it is outstanding and the instructor knows more about the subject than any of the dozen data science instructors I've had on Udemy. Please purchase!
Ibrahim
May 19, 2023
I would like to see more use cases on Feature Creation in the course. Mainly, in which, the situation that features is created because of business needs. Also more deeply diving into Feature Engineering in general.
MAH
May 16, 2023
Soledad has the most in-depth feature engineering training I came across so far. You will learn about feature engineering problems and solutions using functions from sklearn you may have never thought of before. You will also learn about feature-engine, which is more user friendly, with more intuitive function names (relating to the function's purpose) and parameters. Soledad is also very active in the Q&A section and provided helpful responses to questions I had (some are new, and some are questions already asked by other people).
Tomáš
April 11, 2023
It's great, same as all other courses from Sole. Clear explanations, good choice of topics, I learnt quite a few things I dint know. Would recommend.
Shivaji
March 28, 2023
This course is very good in terms of learning some new techniques of Feature Engineering. I like the most is the use of feature-engine library, which helps to utilize more functionality in our code. And also how to setup Feature Engineering Pipeline is very well explained in this course.

Charts

Price

Feature Engineering for Machine Learning - Price chart

Rating

Feature Engineering for Machine Learning - Ratings chart

Enrollment distribution

Feature Engineering for Machine Learning - Distribution chart
1304050
udemy ID
7/25/2017
course created date
11/19/2019
course indexed date
Bot
course submited by