sql-fundamentals

SQL Fundamentals Core Knowledge

Deep Knowledge: Use mcp__documentation__fetch_docs with technology: sql for comprehensive documentation.

SQL Statement Categories

Category Statements Purpose

DML SELECT, INSERT, UPDATE, DELETE, MERGE Data manipulation

DDL CREATE, ALTER, DROP, TRUNCATE Schema definition

DCL GRANT, REVOKE Access control

TCL BEGIN, COMMIT, ROLLBACK, SAVEPOINT Transaction control

SELECT Statement

SELECT [DISTINCT] columns FROM table [JOIN other_table ON condition] [WHERE condition] [GROUP BY columns] [HAVING condition] [ORDER BY columns [ASC|DESC]] [LIMIT n OFFSET m];

Execution Order

FROM (and JOINs)
WHERE
GROUP BY
HAVING
SELECT
DISTINCT
ORDER BY
LIMIT/OFFSET

INSERT Patterns

-- Single row INSERT INTO users (name, email) VALUES ('John', 'john@example.com');

-- Multiple rows INSERT INTO users (name, email) VALUES ('John', 'john@example.com'), ('Jane', 'jane@example.com');

-- Insert from SELECT INSERT INTO users_backup (name, email) SELECT name, email FROM users WHERE created_at < '2024-01-01';

-- Insert with RETURNING (PostgreSQL) INSERT INTO users (name, email) VALUES ('John', 'john@example.com') RETURNING id, created_at;

UPDATE Patterns

-- Simple update UPDATE users SET name = 'John Doe' WHERE id = 1;

-- Multiple columns UPDATE users SET name = 'John', status = 'active' WHERE id = 1;

-- Update with subquery UPDATE orders SET status = 'shipped' WHERE user_id IN (SELECT id FROM users WHERE is_premium = true);

-- Update with JOIN (varies by database) -- PostgreSQL UPDATE orders o SET status = 'vip' FROM users u WHERE o.user_id = u.id AND u.is_premium = true;

DELETE Patterns

-- Delete with condition DELETE FROM users WHERE id = 1;

-- Delete with subquery DELETE FROM orders WHERE user_id IN ( SELECT id FROM users WHERE status = 'deleted' );

-- Soft delete pattern (prefer this) UPDATE users SET deleted_at = NOW() WHERE id = 1;

JOIN Types

Join Type Returns

INNER JOIN

Only matching rows from both tables

LEFT JOIN

All left + matching right (NULL if no match)

RIGHT JOIN

All right + matching left (NULL if no match)

FULL OUTER JOIN

All rows from both tables

CROSS JOIN

Cartesian product (all combinations)

-- INNER JOIN SELECT u.name, o.total FROM users u INNER JOIN orders o ON o.user_id = u.id;

-- LEFT JOIN (include users without orders) SELECT u.name, COALESCE(o.total, 0) as total FROM users u LEFT JOIN orders o ON o.user_id = u.id;

-- Self JOIN (hierarchical data) SELECT e.name as employee, m.name as manager FROM employees e LEFT JOIN employees m ON e.manager_id = m.id;

Aggregations

-- Basic aggregates SELECT COUNT(*) as total, COUNT(DISTINCT user_id) as unique_users, SUM(amount) as total_amount, AVG(amount) as avg_amount, MIN(amount) as min_amount, MAX(amount) as max_amount FROM orders;

-- GROUP BY SELECT user_id, COUNT(*) as order_count, SUM(amount) as total FROM orders GROUP BY user_id;

-- HAVING (filter after GROUP BY) SELECT user_id, SUM(amount) as total FROM orders GROUP BY user_id HAVING SUM(amount) > 1000;

DDL - Table Definition

CREATE TABLE users ( id SERIAL PRIMARY KEY, -- PostgreSQL auto-increment -- id INT AUTO_INCREMENT PRIMARY KEY -- MySQL -- id INT IDENTITY(1,1) PRIMARY KEY -- SQL Server email VARCHAR(255) NOT NULL UNIQUE, name VARCHAR(100) NOT NULL, status VARCHAR(20) DEFAULT 'active', created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP, updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,

CONSTRAINT chk_status CHECK (status IN ('active', 'inactive', 'deleted'))

);

-- Foreign key CREATE TABLE orders ( id SERIAL PRIMARY KEY, user_id INT NOT NULL, total DECIMAL(10, 2) NOT NULL,

CONSTRAINT fk_orders_user
    FOREIGN KEY (user_id) REFERENCES users(id)
    ON DELETE CASCADE
    ON UPDATE CASCADE

);

ALTER TABLE

-- Add column ALTER TABLE users ADD COLUMN phone VARCHAR(20);

-- Drop column ALTER TABLE users DROP COLUMN phone;

-- Modify column ALTER TABLE users ALTER COLUMN name TYPE VARCHAR(200);

-- Add constraint ALTER TABLE users ADD CONSTRAINT uq_phone UNIQUE (phone);

-- Drop constraint ALTER TABLE users DROP CONSTRAINT uq_phone;

-- Rename column ALTER TABLE users RENAME COLUMN name TO full_name;

-- Rename table ALTER TABLE users RENAME TO customers;

Indexes

-- Create index CREATE INDEX idx_users_email ON users(email);

-- Unique index CREATE UNIQUE INDEX idx_users_email ON users(email);

-- Composite index CREATE INDEX idx_orders_user_date ON orders(user_id, created_at DESC);

-- Partial index (PostgreSQL) CREATE INDEX idx_active_users ON users(email) WHERE status = 'active';

-- Drop index DROP INDEX idx_users_email;

Transactions

-- Basic transaction BEGIN; UPDATE accounts SET balance = balance - 100 WHERE id = 1; UPDATE accounts SET balance = balance + 100 WHERE id = 2; COMMIT;

-- With savepoint BEGIN; UPDATE accounts SET balance = balance - 100 WHERE id = 1; SAVEPOINT after_debit;

UPDATE accounts SET balance = balance + 100 WHERE id = 2;
-- Something went wrong with the credit
ROLLBACK TO after_debit;

-- Try different approach
UPDATE accounts SET balance = balance + 100 WHERE id = 3;

COMMIT;

-- Rollback on error BEGIN; -- operations... ROLLBACK; -- cancel all changes

Isolation Levels

Level Dirty Read Non-Repeatable Read Phantom Read

READ UNCOMMITTED Yes Yes Yes

READ COMMITTED No Yes Yes

REPEATABLE READ No No Yes

SERIALIZABLE No No No

-- Set isolation level SET TRANSACTION ISOLATION LEVEL SERIALIZABLE; BEGIN; -- operations COMMIT;

NULL Handling

-- Check for NULL SELECT * FROM users WHERE phone IS NULL; SELECT * FROM users WHERE phone IS NOT NULL;

-- COALESCE (first non-null) SELECT COALESCE(phone, 'N/A') as phone FROM users;

-- NULLIF (return NULL if equal) SELECT NULLIF(status, 'unknown') FROM users;

-- NULL in aggregates (ignored except COUNT()) SELECT AVG(score) FROM tests; -- NULLs ignored SELECT COUNT() FROM tests; -- counts all rows SELECT COUNT(score) FROM tests; -- counts non-NULL only

Subqueries

-- Scalar subquery SELECT name, (SELECT COUNT(*) FROM orders WHERE user_id = users.id) as order_count FROM users;

-- IN subquery SELECT * FROM users WHERE id IN ( SELECT DISTINCT user_id FROM orders WHERE total > 100 );

-- EXISTS subquery (often faster than IN) SELECT * FROM users u WHERE EXISTS ( SELECT 1 FROM orders o WHERE o.user_id = u.id AND o.total > 100 );

-- Correlated subquery SELECT * FROM orders o1 WHERE total > ( SELECT AVG(total) FROM orders o2 WHERE o2.user_id = o1.user_id );

Best Practices

Use parameterized queries (prevent SQL injection)
Add indexes on WHERE/JOIN columns
Use appropriate data types
Define foreign keys for data integrity
Use transactions for multiple related operations
Use EXPLAIN to analyze query performance

DON'T

Use SELECT * in production
UPDATE/DELETE without WHERE clause
Store comma-separated values in columns
Use reserved words as identifiers
Ignore NULL handling

When NOT to Use This Skill

Advanced SQL (CTEs, window functions, recursive queries) - Use sql-advanced skill
PostgreSQL specifics (arrays, JSONB, extensions) - Use postgresql skill
MySQL specifics (engine selection, stored procedures) - Use mysql skill
Document databases - Use mongodb for document-oriented data
Caching - Use redis for caching needs

Anti-Patterns

Anti-Pattern Problem Solution

SELECT * in production Transfers unnecessary data Specify only needed columns

No WHERE on UPDATE/DELETE Unintended changes to all rows Always add WHERE clause

Missing indexes on JOIN columns Slow queries, full table scans Add indexes on foreign keys

String concatenation in SQL SQL injection vulnerability Use parameterized queries

Implicit data type conversions Performance loss, unexpected results Use explicit CAST

Storing CSV in columns Violates 1NF, hard to query Normalize into separate table

Using reserved words as identifiers Syntax errors, portability issues Choose different names

Quick Troubleshooting

Problem Diagnostic Fix

Syntax errors Check SQL dialect Use correct syntax for your database

Slow queries EXPLAIN or EXPLAIN ANALYZE

Add indexes, rewrite query

Deadlocks Check transaction logs Reduce transaction scope, consistent ordering

Foreign key violation Check referenced table data Insert parent record first

Duplicate key error Check UNIQUE constraints Use UPSERT or handle conflict

NULL comparison fails Remember NULL != NULL Use IS NULL, IS NOT NULL

Reference Documentation

DML Patterns
DDL Patterns
JOIN Patterns
Aggregations & Window Functions
Transactions

sql-fundamentals

Safety Notice

Copy this and send it to your AI assistant to learn

Source Transparency

Related Skills

cron-scheduling

token-optimization

react-19