Finding the number of Institutes from each state

Explore how to extract and count the number of institutes from each state in a university ranking dataset using Bash shell commands. Learn to isolate relevant columns, sort data, and use uniq to count unique entries, gaining practical skills in text processing and data analysis.

We'll cover the following...

Do you want to know more?

Here, the command-line option -f specifies which field (column) to extract or cut out from the file and the option (d,) tells that we want delimit the cuts by comma (,). When you run that command, you should see that the output consist only of lines such as university names and states. Note that, despite its name, the cut command does not modify the original file it acts on. Now onto the last part. We would like to count how many unis came from each state. However, this is a complex procedure and there isn’t one command that can do all that; we will have to use two commands. Here we need the command uniq -c to count (hence the -c ) how many unique appearances of each state. However, uniq -c requires the input to be sorted, so the first step is to sort the list of universities and states. We can do this very easily with a command that is conveniently called sort :

1.Course Introduction

2.Project 1: Analyzing the 'US News' University Ranking Data

3.Project 2: Facebook Data Mining

4.Project 3: Australian Cities Crime Statistics

5.Project 4: Shakespearean-era plays and poems data mining

6.Bash Tutorials

7.REGEX Tutorials

8.AWK Tutorials

9.SED, GREP and Find Tutorials

10.Beyond the Text Files! Enter into the Big Data Landscape - Concepts

11.Conclusion

Finding the number of Institutes from each state

Do you want to know more?