Using Pandas in Python — A Practical Guide

Umesh S
2 min readFeb 9, 2023

--

A step-by-step guide to using Pandas in Python, including a simple example that demonstrates how to use the library.

Step 1: Installing Pandas

Before you can start using Pandas, you need to install it. You can install Pandas by running the following command in your terminal or command prompt:

pip install pandas

Step 2: Importing Pandas

Once you have installed Pandas, you need to import it into your Python script before you can start using it. You can import Pandas using the following code:

import pandas as pd

Step 3: Creating a DataFrame

The main data structure in Pandas is the DataFrame, which is a two-dimensional labeled data structure. You can create a DataFrame from a variety of sources, including dictionaries, lists, and other Pandas DataFrames.

Here is an example of how to create a DataFrame from a dictionary:

data = {'name': ['John', 'Jane', 'Jim', 'Joan'],
'age': [32, 28, 41, 35],
'country': ['USA', 'Canada', 'UK', 'Australia']}

df = pd.DataFrame(data)

Step 4: Viewing Data

Once you have created a DataFrame, you can view the data in it using the head or tail methods. The head method displays the first five rows of the DataFrame, while the tail method displays the last five rows.

Here is an example of how to use the head method:

print(df.head())

This will output the following:

   name  age  country
0 John 32 USA
1 Jane 28 Canada
2 Jim 41 UK
3 Joan 35 Australia

Step 5: Manipulating Data

Pandas provides a wide range of functions for manipulating data in a DataFrame, including filtering, sorting, and aggregating.

Here is an example of how to filter a DataFrame based on a condition:

df_filtered = df[df['age'] > 30]

This will return a new DataFrame that only contains rows where the age column is greater than 30.

Step 6: Saving Data

You can save a Pandas DataFrame to a variety of file formats, including CSV, Excel, and SQL.

Here is an example of how to save a DataFrame to a CSV file:

df.to_csv('example.csv', index=False)

In conclusion, Pandas is a powerful and easy-to-use library for data manipulation and analysis in Python. With its user-friendly data structures, flexible functions, and robust capabilities, it is a must-have tool for anyone who works with data in Python.

--

--

Umesh S
Umesh S

Written by Umesh S

Experienced Software Engineer committed to helping others grow and succeed.

No responses yet