The Pandas DataFrame Explained
Welcome to **Day 103**. Today we look at the atom of Data Science: the **DataFrame**.
What is a DataFrame?
Think of a DataFrame as a spreadsheet that lives in your computer's RAM. It has rows, columns, and an **Index**.
Key Components
1. **Series**: A single column. A DataFrame is essentially a collection of Series that share the same index.
2. **Index**: The "ID" of each row. By default, it's 0, 1, 2..., but it can be dates or IDs.
3. **Columns**: The headers of your data.
Useful Inspection Commands
When you load a new dataset, run these immediately:
import pandas as pd
df = pd.read_csv('orders.csv')
print(df.info())
print(df.describe())
The Mental Shift
In SQL, you think about the table as a persistent file in a database. In Python, the DataFrame is a **Variable**. You can transform it, copy it, and delete it in an instant.
Your Task for Today
Create a small dictionary in Python and convert it into a Pandas DataFrame using `pd.DataFrame(data)`.
*Day 104: Loading Data from SQL to Python.*