Building with SQLAlchemy ORM

Build production-grade database applications with SQLAlchemy ORM 2.0+, generic PostgreSQL patterns, and Neon-specific serverless considerations.

Before Implementation

Gather context to ensure successful implementation:

Source Gather

Codebase Existing models, database setup, connection patterns

Conversation Student's specific use case (what they're building), constraints

Skill References Domain patterns from references/ (API docs, best practices, architecture)

User Guidelines Project conventions, proficiency level

Only ask student for THEIR requirements (domain expertise is embedded in this skill).

Persona

You are a Python database architect with production experience building applications with SQLAlchemy ORM. You understand both the generic PostgreSQL patterns (applicable everywhere) and Neon-specific serverless considerations (autoscaling, scale-to-zero, branching). You've built multi-table applications with proper transaction handling, relationships, and connection pooling.

When to Use

Building database models from requirements (defining tables as Python classes)
Implementing CRUD operations safely with transactions
Managing relationships between tables (foreign keys, joins)
Querying data with filters, ordering, and complex joins
Connecting to PostgreSQL or Neon with proper configuration
Teaching database fundamentals to beginners learning persistence

Core Concepts

Models as Classes (ORM Abstraction)

SQLAlchemy maps Python classes to database tables:

from sqlalchemy import Column, Integer, String, Float, ForeignKey from sqlalchemy.orm import declarative_base

Base = declarative_base()

class Expense(Base): tablename = 'expenses'

id = Column(Integer, primary_key=True)
description = Column(String(200))
amount = Column(Float)
category_id = Column(Integer, ForeignKey('categories.id'))

Why this matters: You write Python. SQLAlchemy generates SQL. You don't write SQL by hand.

Sessions as Transactions (Unit of Work)

A session groups database operations into an atomic transaction:

with Session(engine) as session: new_expense = Expense(description="Groceries", amount=45.50, category_id=1) session.add(new_expense) session.commit() # All or nothing

Why this matters: If anything fails, nothing is committed. Guarantees database consistency.

Relationships (Foreign Keys as Navigation)

Define relationships in Python instead of manual joins:

class Category(Base): tablename = 'categories' id = Column(Integer, primary_key=True) name = Column(String(50)) expenses = relationship("Expense", back_populates="category")

class Expense(Base): tablename = 'expenses' id = Column(Integer, primary_key=True) category = relationship("Category", back_populates="expenses")

Usage:

category = session.query(Category).first() print(category.expenses) # All expenses in this category

Queries (Filtering, Ordering, Joining)

Construct queries safely without writing raw SQL:

Filter: expenses > $50

expensive = session.query(Expense).filter(Expense.amount > 50).all()

Order: sorted by amount descending

sorted_expenses = session.query(Expense).order_by(Expense.amount.desc()).all()

Join: expenses with their categories

results = session.query(Expense, Category).join(Category).all()

Neon Connection Specifics

Neon is serverless PostgreSQL with auto-scaling and branching. Key differences:

Connection string: postgresql+psycopg2://user:pass@host/dbname?sslmode=require
Always use SSL: ?sslmode=require (Neon enforces this)
Environment variables: Store credentials in .env (never hardcode)
Auto-pause: Neon pauses compute when idle—connection pools help with this

Decision Logic

Scenario Pattern Why

First database model Single table, one Column type Simplest mental model before relationships

Need to link data Use relationship() + ForeignKey ORM handles complex joins for you

Many concurrent requests Connection pooling with pool_size Neon scales compute; pooling maximizes it

Data consistency critical Transactions with try/except Rollback on error; guarantees atomicity

Want to scale to zero Neon serverless + pool with echo_pool Auto-pause when idle; wake on first request

Debugging queries Enable echo=True in engine See generated SQL

Workflow: Building Budget Tracker

Step 1: Define Models

from sqlalchemy import create_engine, Column, Integer, String, Float, DateTime, ForeignKey, func from sqlalchemy.orm import declarative_base, relationship, Session from datetime import datetime

Base = declarative_base()

class Category(Base): tablename = 'categories' id = Column(Integer, primary_key=True) name = Column(String(50), unique=True) expenses = relationship("Expense", back_populates="category")

class Expense(Base): tablename = 'expenses' id = Column(Integer, primary_key=True) description = Column(String(200)) amount = Column(Float) date = Column(DateTime, default=datetime.utcnow) category_id = Column(Integer, ForeignKey('categories.id')) category = relationship("Category", back_populates="expenses")

Step 2: Create Engine and Tables

import os from dotenv import load_dotenv

load_dotenv()

Connection string from environment

DATABASE_URL = os.getenv("DATABASE_URL")

Format: postgresql+psycopg2://user:password@host/dbname?sslmode=require

engine = create_engine(DATABASE_URL) Base.metadata.create_all(engine)

Step 3: Implement CRUD

def create_expense(session, description, amount, category_id): """Create a new expense.""" try: expense = Expense( description=description, amount=amount, category_id=category_id ) session.add(expense) session.commit() return expense except Exception as e: session.rollback() print(f"Error creating expense: {e}") return None

def read_expenses(session, category_id=None): """Read expenses, optionally filtered by category.""" query = session.query(Expense) if category_id: query = query.filter(Expense.category_id == category_id) return query.all()

def update_expense(session, expense_id, amount=None, description=None): """Update an expense.""" expense = session.query(Expense).filter(Expense.id == expense_id).first() if expense: if amount is not None: expense.amount = amount if description is not None: expense.description = description session.commit() return expense return None

def delete_expense(session, expense_id): """Delete an expense.""" expense = session.query(Expense).filter(Expense.id == expense_id).first() if expense: session.delete(expense) session.commit() return True return False

Step 4: Query with Relationships

Get all expenses for a category

category = session.query(Category).filter_by(name="Food").first() print(category.expenses) # Uses relationship

Total spent by category

totals = session.query( Category.name, func.sum(Expense.amount).label('total') ).join(Expense).group_by(Category.name).all()

for name, total in totals: print(f"{name}: ${total:.2f}")

Step 5: Handle Transactions Safely

def transfer_expense(session, expense_id, new_category_id): """Move expense to different category (must succeed fully or not at all).""" try: expense = session.query(Expense).filter(Expense.id == expense_id).first() if not expense: raise ValueError(f"Expense {expense_id} not found")

    expense.category_id = new_category_id
    session.commit()
    return True
except Exception as e:
    session.rollback()
    print(f"Transaction failed, rolled back: {e}")
    return False

Step 6: Connect to Neon

Environment file (.env):

DATABASE_URL=postgresql+psycopg2://user:password@ep-ABC123.neon.tech/dbname?sslmode=require

Connection with pool configuration:

from sqlalchemy.pool import QueuePool

engine = create_engine( DATABASE_URL, poolclass=QueuePool, pool_size=5, max_overflow=10, pool_pre_ping=True, # Verify connections before use echo=False # Set to True for debugging )

MCP Integration

To connect SQLAlchemy database operations to AI agents:

Define an MCP tool that the agent can use

def query_expenses_by_category(category_name: str) -> list: """Agent can ask: 'How much did I spend on groceries?'""" with Session(engine) as session: return session.query(Expense).join(Category).filter( Category.name == category_name ).all()

def summarize_spending(start_date, end_date) -> dict: """Agent can generate reports.""" with Session(engine) as session: return session.query( Category.name, func.sum(Expense.amount).label('total'), func.count(Expense.id).label('count') ).join(Expense).filter( Expense.date.between(start_date, end_date) ).group_by(Category.name).all()

Safety & Guardrails

NEVER

❌ Hardcode credentials in Python files
❌ Skip error handling around transactions
❌ Trust user input without validation
❌ Commit secrets to git
❌ Skip connection pooling for production

ALWAYS

✅ Use environment variables for connection strings (.env file)
✅ Wrap transactions in try/except blocks with rollback
✅ Validate and sanitize all user input before database operations
✅ Use session.commit() explicitly (never auto-commit)
✅ Use session.rollback() on errors
✅ Enable pool_pre_ping=True to check connection health
✅ Use ?sslmode=require with Neon (enforced anyway)

Common Mistakes

Mistake Impact Fix

Forgetting session.commit()

Changes not saved Always call commit() or use context manager

Not rolling back on error Partial data in database Wrap in try/except with rollback()

Hardcoding credentials Security breach Use environment variables

No connection pooling Neon compute scaling inefficient Set pool_size parameter

Raw user input in queries SQL injection Use parameterized queries (ORM does this)

Budget Tracker Example (Complete)

See references/budget-tracker-complete.py for a fully working Budget Tracker application with:

Model definitions
Database setup
CRUD functions
Transaction handling
Neon connection
Example usage

Troubleshooting

Problem Cause Solution

ModuleNotFoundError: No module named 'sqlalchemy'

Not installed pip install sqlalchemy or uv add sqlalchemy

ModuleNotFoundError: No module named 'psycopg2'

PostgreSQL driver missing pip install psycopg2-binary or uv add psycopg2-binary

OperationalError: could not connect to server

Wrong connection string or Neon offline Check DATABASE_URL format, verify Neon project is running

IntegrityError: duplicate key value

Inserting duplicate unique field Check if value already exists, use update instead

ForeignKeyError: could not create foreign key

Category doesn't exist Create category first, or use valid category_id

Queries are slow No indexes, missing relationships Check references/architecture.md for indexing patterns

Resources

Official Docs: https://docs.sqlalchemy.org/en/20/orm/quickstart.html
Neon Docs: https://neon.com/docs/
PostgreSQL Types: https://www.postgresql.org/docs/current/datatype.html

building-with-sqlalchemy-orm

Safety Notice

Copy this and send it to your AI assistant to learn

Filter: expenses > $50

Order: sorted by amount descending

Join: expenses with their categories

Connection string from environment

Format: postgresql+psycopg2://user:password@host/dbname?sslmode=require

Get all expenses for a category

Total spent by category

Define an MCP tool that the agent can use

Source Transparency

Related Skills

pptx

quiz-generator

assessment-architect

ai-collaborate-teaching