ModuleNotFoundError: No module named ‘pandas’

In Python, ModuleNotFoundError: No module named ‘pandas’ error occurs if we try to import the ‘pandas‘ module without installing the package or if you have not installed it in the correct environment.

In this tutorial, let’s look at installing the pandas module correctly in different operating systems and solve ModuleNotFoundError: No module named ‘pandas’ error.  

What is ModuleNotFoundError: No module named ‘pandas’?

There are various reasons why we get the ModuleNotFoundError: No module named ‘pandas’ error

  • Trying to use the module- without installing the pandas package.
  • If the IDE is set to the incorrect version of the Python/Python interpreter.
  • You are using the virtual environment and the pandas module is not installed inside a virtual environment
  • Installing the pandas package in a different version of Python than the one which is used currently.
  • Declaring a variable name as the module name(pandas)

If you are getting an error installing pip, checkout pip: command not found to resolve the issue.

How to fix ModuleNotFoundError: No module named ‘pandas’?

pandas is not a built-in module (it doesn’t come with the default python installation) in Python; you need to install it explicitly using the pip installer and then use it.  

pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language.

We can fix the error by installing the ‘pandas‘ module by running the pip install pandas command in your terminal/shell.

We can verify if the package is installed correctly by running the following command in the terminal/shell.

This will provide the details of the package installed, including the version number, license, and the path it is installed. If the module is not installed, you will get a warning message in the terminal stating WARNING: Package(s) not found: pandas.

pip show pandas

Output

Name: pandas
Version: 1.4.3
Summary: Powerful data structures for data analysis, time series, and statistics
Home-page: https://pandas.pydata.org
Author: The Pandas Development Team
Author-email: pandas-dev@python.org
License: BSD-3-Clause
Location: c:\personal\ijs\python_samples\venv\lib\site-packages
Requires: numpy, pytz, python-dateutil

Solution 1 – Installing and using the pandas module in a proper way

Based on the Python version and the operating system you are running, run the relevant command to install the pandas module.

# If you are using Python 2 (Windows)
pip install pandas

# if you are using Python 3 (Windows)
pip3 install pandas

# If the pip is not set as environment varibale PATH
python -m pip install pandas

# If you are using Python 2 (Linux)
sudo pip install pandas

# if you are using Python 3 (Linux)
sudo pip3 install pandas

# In case if you have to easy_install
sudo easy_install -U pandas

# On Centos
yum install pandas

# On Ubuntu
sudo apt-get install pandas

# If you are installing it in Anaconda 
conda install -c anaconda pandas

Once you have installed the pandas module, we can now import it inside our code and use it as shown below.

import pandas as pd

df = pd.DataFrame(
    {
        "Name": [
            "Chandler Bing",
            "Tom Hanks",
            "Will Smith",
        ],
        "Age": [22, 35, 58],
        "Gender": ["male", "male", "male"],
    }
)
print(df)

Output

            Name  Age Gender
0  Chandler Bing   22   male
1      Tom Hanks   35   male
2     Will Smith   58   male

Solution 2 – Verify if the IDE is set to use the correct Python version

If you are still getting the same error even after installing the package, you can verify if the IDE you are using is configured with the correct version of the Python interpreter.

For Eg:- In the case of Visual Studio Code, we can set the Python version by pressing CTRL + Shift + Por ( + Shift + P on Mac) to open the command palette.

Once the command palette opens, select the Python interpreter and select the correct version of Python and also the virtual environment(if configured).

Solution 3 – Installing pandas inside the virtual environment

Many different IDEs like Jupyter Notebook, Spyder, Anaconda, or PyCharm often install their own virtual environment of Python to keep things clean and separated from your global Python.

If you are using VS Code, then you can also create a virtual environment, as shown below.

In the case of virtual environments, you need to ensure that the pandas module needs to be installed inside the virtual environment and not globally.

Step 1: Create a Virtual Environment. If you have already created a virtual environment, then proceed to step 2.

Step 2: Activate the Virtual Environment

Step 3: Install the required module using the pip install command

# Create a virtual Environment
py -3 -m venv venv

# Activate the virtual environment (windows command)
venv\Scripts\activate.bat

# Activate the virtual environment (windows powershell)
venv\Scripts\Activate.ps1

# Activate the virtual environment (Linux)
source venv/bin/activate

# Install pandas inside the virtual environment
pip install pandas

Solution 4 – Ensure that a module name is not declared name a variable name.

Last but not least, you may need to cross-check and ensure that you haven’t declared a variable with the same name as the module name.

You should check if you haven’t named any files as pandas.py as it may shadow the original pandas module.

If the issue is still not solved, you can try removing the package and installing it once again, restart the IDE, and check the paths to ensure that packages are installed in the correct environment path and Python version.

Conclusion

The ModuleNotFoundError: No module named ‘pandas’ error occurs when we try to import the ‘pandas‘ module without installing the package or if you have not installed it in the correct environment.

We can resolve the issue by installing the pandas module by running the pip install pandas command. Also, ensure that the module is installed in the proper environment in case you use any virtual environments, and the Python version is appropriately set in the IDE that you are running the code.