fMRI Preprocessing Pipeline Guide
Purpose
fMRI preprocessing transforms raw scanner data into a form suitable for statistical analysis. Unlike generic data cleaning, every preprocessing decision in fMRI involves domain-specific trade-offs: choosing the wrong step order can introduce artifacts that mimic neural signal, smoothing at the wrong scale destroys the spatial information needed for multivariate analyses, and failing to correct for susceptibility distortions misaligns brain regions by several millimeters.
A competent programmer without neuroimaging training would get many of these decisions wrong. This skill encodes the domain knowledge required to make correct preprocessing choices for different analysis goals.
When to Use This Skill
- Setting up a preprocessing pipeline for task fMRI, resting-state fMRI, or MVPA
- Choosing between preprocessing tools (fMRIPrep, FSL, SPM, AFNI)
- Deciding which steps to include, skip, or modify for a specific analysis type
- Performing quality control on preprocessed data
- Reviewing or troubleshooting an existing preprocessing pipeline
- Selecting parameters for motion correction, smoothing, or normalization
Research Planning Protocol
Before executing the domain-specific steps below, you MUST:
- State the research question — What analysis will follow preprocessing and what does it require?
- Justify the preprocessing choices — Why these steps in this order? What alternatives were considered?
- Declare expected quality metrics — What motion thresholds, SNR values, and exclusion criteria will you use?
- Note assumptions and limitations — What does this pipeline assume about the data? Where could it mislead?
- Present the plan to the user and WAIT for confirmation before proceeding.
For detailed methodology guidance, see the research-literacy skill.
⚠️ Verification Notice
This skill was generated by AI from academic literature. All parameters, thresholds, and citations require independent verification before use in research. If you find errors, please open an issue.
Pipeline Order Decision Tree
The standard preprocessing order is well-established, but branch points exist depending on the analysis type. The canonical order is:
1. DICOM to NIfTI conversion (BIDS format)
2. Discard non-steady-state volumes
3. Slice timing correction (conditional)
4. Motion correction (rigid body)
5. Distortion correction (fieldmap-based)
6. Coregistration (functional to structural)
7. Spatial normalization (to MNI/template space)
8. Smoothing (conditional on analysis type)
Decision points:
Is your TR < 1 second (e.g., multiband)?
|
+-- YES --> Skip slice timing correction; use temporal derivative in GLM instead
| (Sladky et al., 2011; HCP consortium recommendation)
|
+-- NO --> Apply slice timing correction before motion correction
(AFNI and SPM convention; Sladky et al., 2011)
What is your analysis type?
|
+-- Task activation (univariate GLM)
| --> Full pipeline with smoothing (FWHM = 2-3x voxel size)
|
+-- Resting-state connectivity
| --> Full pipeline + aggressive confound regression; smoothing 4-6 mm
|
+-- MVPA / multivariate decoding
--> Full pipeline WITHOUT smoothing (or minimal, <= 2 mm)
Smooth only after first-level pattern extraction
Core Preprocessing Steps
Step 1: DICOM to NIfTI Conversion
Convert raw DICOM files to NIfTI format organized in BIDS (Brain Imaging Data Structure; Gorgolewski et al., 2016). BIDS standardizes file naming and metadata, enabling automated pipeline tools like fMRIPrep to detect acquisition parameters automatically.
Tools: dcm2niix (Li et al., 2016), heudiconv, BIDScoin
Step 2: Discard Non-Steady-State Volumes
The first few volumes of an fMRI run are acquired before the MR signal reaches T1 equilibrium, producing artificially high signal intensity. Discard the first 3-5 volumes (approximately 5-10 seconds) unless the scanner automatically acquired dummy scans (Poldrack et al., 2011, Ch. 5).
Domain insight: Modern scanners often acquire dummy scans that are not saved. Check the acquisition protocol. fMRIPrep detects non-steady-state volumes automatically using signal intensity changes.
Step 3: Slice Timing Correction
Within each TR, slices are acquired sequentially (not simultaneously), creating temporal offsets of up to one full TR between first and last slices. Slice timing correction (STC) interpolates each slice to a common time point (Sladky et al., 2011).
| Condition | Recommendation | Rationale |
|---|---|---|
| TR > 2 s, interleaved acquisition | Always apply STC | Temporal offset is large; significant benefit (Sladky et al., 2011) |
| TR 1-2 s | Apply STC; moderate benefit | Still corrects meaningful timing offsets |
| TR < 1 s (multiband) | Skip STC; use temporal derivative in GLM | Minimal offset; correction provides negligible benefit (HCP consortium) |
| Using dynamic causal modeling (DCM) | Mandatory STC | DCM requires precise timing alignment across regions |
Order debate: STC before motion correction is standard in AFNI and SPM. FSL applies STC after motion correction. Both are acceptable; the optimal order depends on the level of motion and slice acquisition pattern (Parker & Razlighi, 2019). Ideally, joint correction would be applied (Roche, 2011), but this is not yet standard in major packages.
Step 4: Motion Correction
Align all volumes to a reference volume using 6-parameter rigid body (3 translation, 3 rotation) transformation (Jenkinson et al., 2002). This is the single most critical preprocessing step.
| Parameter | Recommendation | Source |
|---|---|---|
| Degrees of freedom | 6 (rigid body) | Jenkinson et al., 2002 |
| Reference volume | Mean image or middle volume | Jenkinson et al., 2002 |
| Cost function | Normalized correlation | Jenkinson et al., 2002 (MCFLIRT default) |
| Interpolation | Trilinear (during estimation); sinc or spline (final reslice) | Poldrack et al., 2011, Ch. 5 |
Domain insight: Motion correction is inherently imperfect because each slice within a volume was acquired at a different time. When the head moves during a TR, each slice has a slightly different rigid-body transformation, but whole-volume correction applies a single transformation. This is an unavoidable limitation (Jenkinson et al., 2002).
Output: Save the 6 motion parameters for use as confound regressors in the GLM (see fmri-glm-analysis-guide).
Step 5: Distortion Correction
EPI images suffer geometric distortions along the phase-encoding direction due to B0 field inhomogeneities. Distortions are worst near air-tissue boundaries: orbitofrontal cortex, anterior temporal lobes, and inferior temporal regions (Jezzard & Balaban, 1995).
| Method | Data Required | Tool | Source |
|---|---|---|---|
| Fieldmap-based (FUGUE) | Dual-echo gradient echo fieldmap | FSL FUGUE | Jezzard & Balaban, 1995 |
| Reverse phase-encoding (TOPUP) | Opposite-PE EPI pair (AP/PA) | FSL TOPUP | Andersson et al., 2003 |
| SyN-based (fieldmapless) | T1-weighted image only | ANTs SyN-SDC | fMRIPrep fallback |
Domain warning: If no fieldmap data were acquired, fMRIPrep can perform fieldmapless distortion correction using nonlinear registration to the T1, but this is less accurate than fieldmap-based methods. Always acquire fieldmap data when possible.
Step 6: Coregistration
Align the functional (EPI) image to the subject's structural (T1-weighted) image. This enables projecting functional results onto anatomical space and provides the bridge to template normalization.
- Method: Boundary-based registration (BBR; Greve & Fischl, 2009) is preferred over intensity-based methods for EPI-to-T1 alignment because it uses white matter boundaries, which are well-defined in both modalities
- Always visually inspect the coregistration overlay (functional edges on structural image)
Step 7: Spatial Normalization
Warp each subject's brain to a standard template space to enable group-level comparisons.
| Parameter | Recommendation | Source |
|---|---|---|
| Template | MNI152NLin2009cAsym (fMRIPrep default) or MNI152NLin6Asym | Fonov et al., 2011 |
| Method | Nonlinear (ANTs SyN or SPM Unified Segmentation) | Ashburner & Friston, 2005; Avants et al., 2008 |
| Output resolution | 2 mm isotropic (standard) | Convention; matches MNI template resolution |
| For high-resolution data | Match native resolution (e.g., 1.5 mm) | Preserve spatial detail |
Domain warning: Always visually inspect normalization quality. Check that major sulci (central sulcus, Sylvian fissure) and subcortical structures (caudate, putamen) align with the template. Poor normalization is a common but silent source of error in group analyses.
Step 8: Spatial Smoothing
Smoothing with a Gaussian kernel increases SNR, satisfies the smoothness assumptions of Random Field Theory, and reduces inter-subject anatomical variability (Mikl et al., 2008).
| Analysis Type | FWHM Recommendation | Rationale | Source |
|---|---|---|---|
| Univariate (task GLM) | 2-3x voxel size (e.g., 6-8 mm for 2-3 mm voxels) | Matches expected activation extent; maximizes sensitivity | Mikl et al., 2008; Poldrack et al., 2011 |
| Resting-state connectivity | 4-6 mm | Moderate smoothing; balance noise reduction and spatial specificity | Ciric et al., 2017 |
| MVPA / decoding | None or <= 2 mm | Smoothing destroys fine-grained spatial patterns essential for decoding | Misaki et al., 2013 |
| Searchlight analysis | None | Searchlight already averages within the sphere | Etzel et al., 2013 |
Domain warning: Smoothing before MVPA is one of the most common preprocessing errors. For multivariate analyses, skip smoothing during preprocessing entirely. If group-level smoothing is needed, apply it only after the first-level pattern analysis is complete (Misaki et al., 2013).
Step 9: Confound Regression
Confound regression removes variance from non-neural sources. This step is typically performed during the statistical model (GLM) rather than as a separate preprocessing step, but the preprocessing pipeline must output the confound time series.
For detailed confound regression guidance, see the fmri-glm-analysis-guide skill.
Key confounds to extract during preprocessing:
- 6 motion parameters (from Step 4) and their temporal derivatives and squared terms (24-parameter model; Friston et al., 1996)
- Framewise displacement (FD; Power et al., 2012)
- DVARS (Power et al., 2012)
- aCompCor components from white matter and CSF (Behzadi et al., 2007)
- Global signal (optional; controversial for connectivity analyses; Murphy & Fox, 2017)
Pipeline Variants by Analysis Type
| Step | Task Activation | Resting-State Connectivity | MVPA |
|---|---|---|---|
| Non-steady-state removal | Yes | Yes | Yes |
| Slice timing correction | Yes (if TR > 1 s) | Yes (if TR > 1 s) | Yes (if TR > 1 s) |
| Motion correction | Yes | Yes | Yes |
| Distortion correction | Yes | Yes | Yes |
| Coregistration | Yes | Yes | Yes |
| Normalization | Yes (2 mm) | Yes (2 mm) | Optional; can stay in native space |
| Smoothing | 6-8 mm FWHM | 4-6 mm FWHM | None |
| High-pass filter | 128 s (in GLM) | 0.01 Hz (in preprocessing) | 128 s (in GLM) |
| Band-pass filter | No | 0.01-0.1 Hz (controversial) | No |
| Motion threshold (FD) | 0.5 mm (spike regress) | 0.2 mm (scrub or regress) | 0.5 mm |
| Confound model | 24-param + aCompCor | 36-param or aCompCor + GSR | Minimal (6-param motion) |
fMRIPrep as the Recommended Standard
fMRIPrep (Esteban et al., 2019) is recommended as the default preprocessing tool for most fMRI studies. It provides:
- Automated, analysis-agnostic preprocessing that adapts to the specific dataset
- Transparent, reproducible workflow with detailed visual reports for QC
- Best-in-breed algorithms: ANTs for normalization, FreeSurfer for surface reconstruction, MCFLIRT/ANTs for motion correction
- BIDS-compatible input and output
- Comprehensive confound time series output (motion, CompCor, FD, DVARS)
What fMRIPrep does NOT do:
- Smoothing (intentionally left to the user because the optimal kernel depends on analysis type)
- Temporal filtering (left to the GLM stage)
- Confound regression (outputs confounds but does not regress them)
- Statistical analysis
Domain insight: fMRIPrep outputs the preprocessed BOLD data in both MNI space and native space. For MVPA, use the native-space output. For group analyses, use the MNI-space output.
Common Pitfalls
-
Smoothing before MVPA: Spatial smoothing destroys the fine-grained voxel patterns that multivariate methods rely on. Skip smoothing entirely for MVPA and searchlight analyses (Misaki et al., 2013)
-
Wrong interpolation for final reslicing: Use sinc or spline interpolation for the final reslice step. Trilinear interpolation is acceptable during motion parameter estimation but introduces blurring in final images (Poldrack et al., 2011, Ch. 5)
-
Not checking normalization quality: Normalization can fail silently, especially in populations with atypical anatomy (older adults, patients with lesions, pediatric). Always visually inspect the overlap of normalized functional images with the template
-
Motion-connectivity confound in resting-state: Head motion creates spurious short-distance correlations and reduces long-distance correlations in functional connectivity (Power et al., 2012). For resting-state analyses, use stringent motion thresholds (FD < 0.2 mm; Power et al., 2014) and aggressive confound regression (Ciric et al., 2017)
-
Skipping distortion correction: Without distortion correction, orbitofrontal and anterior temporal signals are mislocalized by several millimeters. This is especially problematic for studies of emotion, reward, and memory, which involve these regions (Jezzard & Balaban, 1995)
-
Applying band-pass filtering for task fMRI: Band-pass filtering (0.01-0.1 Hz) is appropriate for resting-state connectivity analysis but removes task-related signal in event-related and block designs. For task fMRI, use only high-pass filtering in the GLM
-
Not discarding non-steady-state volumes: The first few volumes have inflated signal intensity. If not removed, they can bias motion estimates and inflate variance (Poldrack et al., 2011, Ch. 5)
For detailed step-by-step parameters and software-specific guidance, see references/step-by-step-pipeline.md.
For quality control metrics, thresholds, and exclusion criteria, see references/quality-control.md.
Key References
- Andersson, J. L. R., Skare, S., & Ashburner, J. (2003). How to correct susceptibility distortions in spin-echo echo-planar images: application to diffusion tensor imaging. NeuroImage, 20(2), 870-888.
- Ashburner, J., & Friston, K. J. (2005). Unified segmentation. NeuroImage, 26(3), 839-851.
- Avants, B. B., Epstein, C. L., Grossman, M., & Gee, J. C. (2008). Symmetric diffeomorphic image registration with cross-correlation. Medical Image Analysis, 12(1), 26-41.
- Behzadi, Y., Restom, K., Liau, J., & Liu, T. T. (2007). A component based noise correction method (CompCor) for BOLD and perfusion based fMRI. NeuroImage, 37(1), 90-101.
- Ciric, R., Wolf, D. H., Power, J. D., et al. (2017). Benchmarking of participant-level confound regression strategies. NeuroImage, 154, 174-187.
- Esteban, O., Birman, D., Schaer, M., et al. (2017). MRIQC: Advancing the automatic prediction of image quality in MRI from unseen sites. PLoS ONE, 12(9), e0184661.
- Esteban, O., Markiewicz, C. J., Blair, R. W., et al. (2019). fMRIPrep: a robust preprocessing pipeline for functional MRI. Nature Methods, 16, 111-116.
- Fonov, V. S., Evans, A. C., Botteron, K., et al. (2011). Unbiased average age-appropriate atlases for pediatric studies. NeuroImage, 54(1), 313-327.
- Friston, K. J., Williams, S., Howard, R., et al. (1996). Movement-related effects in fMRI time-series. Magnetic Resonance in Medicine, 35(3), 346-355.
- Gorgolewski, K. J., Auer, T., Calhoun, V. D., et al. (2016). The brain imaging data structure. Scientific Data, 3, 160044.
- Greve, D. N., & Fischl, B. (2009). Accurate and robust brain image alignment using boundary-based registration. NeuroImage, 48(1), 63-72.
- Jenkinson, M., Bannister, P., Brady, J. M., & Smith, S. M. (2002). Improved optimization for the robust and accurate linear registration and motion correction of brain images. NeuroImage, 17(2), 825-841.
- Jezzard, P., & Balaban, R. S. (1995). Correction for geometric distortion in echo planar images from B0 field variations. Magnetic Resonance in Medicine, 34(1), 65-73.
- Mikl, M., Marecek, R., Hlustik, P., et al. (2008). Effects of spatial smoothing on fMRI group inferences. Magnetic Resonance Imaging, 26(4), 490-503.
- Misaki, M., Luh, W. M., & Bandettini, P. A. (2013). The effect of spatial smoothing on fMRI decoding of columnar-level organization. NeuroImage, 78, 13-22.
- Murphy, K., & Fox, M. D. (2017). Towards a consensus regarding global signal regression for resting state functional connectivity MRI. NeuroImage, 154, 169-173.
- Parker, D. B., & Razlighi, Q. R. (2019). The benefit of slice timing correction in common fMRI preprocessing pipelines. Frontiers in Neuroscience, 13, 821.
- Poldrack, R. A., Mumford, J. A., & Nichols, T. E. (2011). Handbook of Functional MRI Data Analysis. Cambridge University Press.
- Power, J. D., Barnes, K. A., Snyder, A. Z., et al. (2012). Spurious but systematic correlations in functional connectivity MRI networks arise from subject motion. NeuroImage, 59(3), 2142-2154.
- Power, J. D., Mitra, A., Laumann, T. O., et al. (2014). Methods to detect, characterize, and remove motion artifact in resting state fMRI. NeuroImage, 84, 320-341.
- Sladky, R., Friston, K., Trostl, J., et al. (2011). Slice-timing effects and their correction in functional MRI. NeuroImage, 58(2), 588-594.
See references/ for detailed pipeline parameters and quality control procedures.