Prerequisites and Required Libraries

Learn how to install libraries for a Github scraping application.

In this chapter, we’ll build a web app using Streamlit which scrapes GitHub for user information. It displays the basic information and some of their recent repositories. We’ll use the Beautiful Soup library for web scraping. The chapter is divided into two sections. In the first section we’ll explore web scraping using Beautiful Soup. In the second section we’ll talk about using the scraped data and Streamlit to build a web app.

Install the required libraries

First, we need to install the required libraries. It is recommended that you create a virtual environment before installing the libraries.

python -m virtualenv venv  # setup your virtual environment
venv/Scripts/activate # activate the virtual environment
pip install beautifulsoup4 , streamlit

Import the libraries

Import the following libraries into your Python program:

from bs4 import BeautifulSoup
import requests
import pandas as pd