How to implement a LightGBM classifier in Python

from sklearn.datasets import load_breast_cancer
from sklearn.model_selection import train_test_split
from sklearn import metrics
from sklearn.metrics import accuracy_score, classification_report
import lightgbm as lgb
# Load the breast cancer dataset
data = load_breast_cancer()
# Extract the features (X) and target (y)
X = data.data
y = data.target
# Splitting the dataset
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.20)
# Training the model
model = lgb.LGBMClassifier(n_estimators=100, max_depth=6, learning_rate=0.1, objective='binary', verbosity=-1)
# Fit the model on the training data
model.fit(X_train, y_train)
# Make predictions on the test data
y_pred = model.predict(X_test)
# Calculate and print the accuracy of the model
accuracy = accuracy_score(y_test, y_pred)
print("Accuracy: {:.2f}%".format(accuracy * 100))
# Print the classification report of the model
report = classification_report(y_test, y_pred)
print("Classification Report:\n", report)

Code explanation:

Line 8: We load the breast cancer dataset from sklearn and store it in the data variable.
Lines 11–12: We extract the feature matrix X and the target vector y from the loaded dataset. X contains the input data, and y contains the binary classification labels.
Line 15: We split the dataset into training (X_train and y_train) and testing (X_test and y_test) sets using the train_test_split() function. Here, 20% of the data is reserved for testing, and 80% is used for training.
Line 18: We create an instance of the LGBMClassifier class with specified parameters.
Line 21: We train the model on the training data using the fit() method.
Line 24: The trained model is used to make predictions on the test data.
Lines 27–28: We calculate the accuracy of the model’s predictions by comparing them to the true labels in the test set. The accuracy is printed as a percentage.
Lines 31–32: We generate and print the classification report for the model.

Free AI Mock Interviews

Coding Interview

Coding PatternsFree Interview

Gain insights and practical experience with coding patterns through targeted MCQs and coding problems, designed to match and challenge your expertise level.

System Design

YouTubeFree Interview

Learn to design a video streaming platform like YouTube by tackling functional and non-functional requirements, core components, and high-level to detailed design challenges.

Free Resources

Argument	Description
`n_estimators`	Specifies the number of boosting rounds or iterations
`max_depth`	Sets the maximum number of leaves in one tree (It’s important for controlling the complexity of the model and avoiding overfitting)
`learning_rate`	Defines the increment size during each iteration as it converges towards minimizing the loss function
`objective`	Specifies the learning task and the corresponding objective function

How to implement a LightGBM classifier in Python

Installation

Implementing a LightGBM classifier

Import the libraries

Load the dataset

Understand the parameters

LGBMClassifier Parameters

Train the model

Make a prediction

Evaluate the model

Example

Code explanation: