The Pandas DataFrame Explained

Welcome to **Day 103**. Today we look at the atom of Data Science: the **DataFrame**.

What is a DataFrame?

Think of a DataFrame as a spreadsheet that lives in your computer's RAM. It has rows, columns, and an **Index**.

Key Components

1. **Series**: A single column. A DataFrame is essentially a collection of Series that share the same index.

2. **Index**: The "ID" of each row. By default, it's 0, 1, 2..., but it can be dates or IDs.

3. **Columns**: The headers of your data.

Useful Inspection Commands

When you load a new dataset, run these immediately:

`df.info()`: Shows data types and if there are NULLs.

`df.describe()`: Shows statistical summaries (Mean, Max, Min).

`df.shape`: Tells you exactly many rows and columns شما have.

import pandas as pd

df = pd.read_csv('orders.csv')

print(df.info())

print(df.describe())

The Mental Shift

In SQL, you think about the table as a persistent file in a database. In Python, the DataFrame is a **Variable**. You can transform it, copy it, and delete it in an instant.

Your Task for Today

Create a small dictionary in Python and convert it into a Pandas DataFrame using `pd.DataFrame(data)`.

*Day 104: Loading Data from SQL to Python.*

What is a DataFrame?

Key Components

Useful Inspection Commands

The Mental Shift

Your Task for Today

Ready to put your knowledge into practice?