pythonbeginner
Pandas DataFrame Transformations
Common pandas DataFrame transformations including column operations, type casting, and string methods.
pythonPress ⌘/Ctrl + Shift + C to copy
import pandas as pd
df = pd.read_csv("data.csv")
# Rename columns
df = df.rename(columns={"old_name": "new_name", "col2": "column_two"})
# Add computed column
df["total"] = df["price"] * df["quantity"]
# Apply function to column
df["name"] = df["name"].str.strip().str.title()
# Type casting
df["date"] = pd.to_datetime(df["date"])
df["amount"] = pd.to_numeric(df["amount"], errors="coerce")
# Conditional column
df["status"] = df["total"].apply(
lambda x: "high" if x > 1000 else "medium" if x > 100 else "low"
)
# Drop columns and duplicates
df = df.drop(columns=["unused_col"]).drop_duplicates(subset=["id"])
# Fill missing values
df["category"] = df["category"].fillna("unknown")
df["amount"] = df["amount"].fillna(df["amount"].median())
# Sort and reset index
df = df.sort_values("date", ascending=False).reset_index(drop=True)
print(df.info())
print(df.head())Use Cases
- Cleaning raw data files for analysis
- Preparing features for machine learning
- Standardizing data formats from multiple sources
Tags
Related Snippets
Similar patterns you can reuse in the same workflow.
pythonbeginner
Pandas DataFrame Filtering Techniques
Filter DataFrames using boolean masks, query syntax, isin, between, and string matching methods.
Best for: Extracting subsets of data for reporting
#pandas#filtering
pythonintermediate
Nested JSON Flattening in Python
Flatten deeply nested JSON structures into flat dictionaries suitable for DataFrames or CSV export.
Best for: Converting API responses to flat tables
#json#flattening
pythonintermediate
Pandas Merge and Join Examples
Combine DataFrames using merge, join, and concat with different join types and key handling.
Best for: Combining data from multiple sources
#pandas#merge
pythonintermediate
Pandas GroupBy Aggregation Examples
GroupBy operations with multiple aggregations, named aggregations, and transform for DataFrame analysis.
Best for: Sales reporting by region and time period
#pandas#groupby