pythonintermediate
Pandera DataFrame Schema Validation
Use Pandera to validate DataFrame schemas with type checks, value constraints, and custom checks.
pythonPress ⌘/Ctrl + Shift + C to copy
import pandera as pa
import pandas as pd
schema = pa.DataFrameSchema(
{
'id': pa.Column(int, pa.Check.greater_than(0)),
'name': pa.Column(str, pa.Check.str_length(1, 100)),
'score': pa.Column(float, pa.Check.in_range(0.0, 100.0)),
'tier': pa.Column(str, pa.Check.isin(['bronze', 'silver', 'gold'])),
},
)
df = pd.DataFrame({'id':[1,2],'name':['Alice','Bob'],'score':[92.5,78.0],'tier':['gold','silver']})
validated = schema.validate(df)
print(validated)Use Cases
- pipeline input validation
- data contracts
- CI quality gates
Tags
Related Snippets
Similar patterns you can reuse in the same workflow.
pythonintermediate
Data Validation with Pydantic
Validate and parse data records using Pydantic models with custom validators and error reporting.
Best for: Validating incoming data before warehouse loading
#validation#pydantic
pythonintermediate
Data Quality Testing with Expectations
Define and run data quality expectations for automated validation in data pipelines.
Best for: Automated data quality gates in pipelines
#data-quality#testing
pythonadvanced
Great Expectations Data Quality Suite
Define and run a Great Expectations validation suite to catch data quality issues early.
Best for: CI data validation
#great-expectations#data-quality
pythonintermediate
Pydantic Models for ETL Validation
Parse and validate raw JSON records against Pydantic models before inserting into a database.
Best for: input validation
#pydantic#validation