Exercise: Anonymizing and Encrypting the Data
Explore how to anonymize sensitive credit card transaction data by masking numbers and encrypt account balances using symmetric key encryption. This lesson helps you apply data protection techniques within ETL transform steps using Python and pandas.
Data anonymization
Consider a dataset containing credit card transaction details of a bank's customers. The transactions are ATM deposits and withdrawals collected in real time as they happen. When a customer uses their credit card to withdraw or deposit money from an ATM, the transaction details are recorded in the dataset and stored in the bank’s database.
Our job is to anonymize the dataset to protect the customer’s credit card data. The dataset contains the following columns:
Customer ID: A unique identifier for each ...