Saturday, September 21, 2024

Import a dataset into a Jupyter Notebook

To import a dataset into a Jupyter Notebook, you can follow these general steps, depending on the format of your dataset (CSV, Excel, JSON, etc.). Here are examples for some common formats:

1. CSV Files

If your dataset is in CSV format, you can use the pandas library:


2. Excel Files

For Excel files, you'll also use pandas, but make sure you have openpyxl or xlrd installed for .xlsx and .xls files, respectively:



3. JSON Files

If your dataset is in JSON format:



4. SQLite Database

If your dataset is in an SQLite database:


5. Loading Data from URLs

You can also load datasets directly from a URL:


To read a CSV file from a directory in a Jupyter Notebook using pandas, follow these steps:

1. Import the Pandas Library

Make sure you have pandas installed. If not, you can install it via pip:


pip install pandas

2. Use the Correct File Path

You'll need to specify the path to your CSV file. Here’s how to do it:


import pandas as pd # Replace 'path/to/your/directory/your_dataset.csv' with path to your CSV file file_path = 'path/to/your/directory/your_dataset.csv' # Read the CSV file df = pd.read_csv(file_path) # Display the first few rows of the dataframe print(df.head())

Example

Assuming your CSV file is in a folder named "data" within the current directory:


import pandas as pd # Specify the path to your CSV file file_path = './data/your_dataset.csv' # Read the CSV file df = pd.read_csv(file_path) # Display the first few rows of the dataframe print(df.head())


No comments:

Post a Comment