Automated Feature Engineering

Overview

Automated feature engineering is a powerful tool for reducing the amount of manual work needed in order to build predictive models. Instead of a data scientist spending days or weeks coming up with the best features to describe a dataset, we can use tools that approximate this process. One library I’ve been working with to implement this step is FeatureTools. It takes inspiration from the automated feature engineering process in deep learning. However, it is meant for shallow learning problems where you already have structured data but need to translate multiple tables into a single record per user.

In our pre-configured execution environment, gcc and Python3.7 are already installed along with the required libraries. To skip local installation instructions and get on with the applications, click here.

Installing libraries

The library can be installed as follows:

Get hands-on with 1200+ tech skills courses.