extraction-form

Extraction Form (systematic review)

Goal: create a consistent, analysis-ready extraction table that is directly grounded in the protocol.

Inputs

Required:

Optional:

Outputs

Workflow

Determine the included set

Build/confirm the schema

Use the extraction schema defined in output/PROTOCOL.md .
If the protocol does not define fields yet, stop and update output/PROTOCOL.md first.

Populate papers/extraction_table.csv

One row per included paper.
If papers/paper_notes.jsonl exists, use it as a structured source for values/provenance (but keep the table schema governed by output/PROTOCOL.md ).
Always include provenance columns:
paper_id , title , year , url
For each protocol-defined field:
fill concrete values (units explicit)
use an explicit sentinel for unknowns (recommended: empty cell + notes )

Keep it auditable

Quick QA

Definition of Done

papers/extraction_table.csv exists.
Every included paper from papers/screening_log.csv has exactly one extraction row.
Column meanings match output/PROTOCOL.md (no ad-hoc columns without updating the protocol).

Troubleshooting

Issue: the protocol does not specify extraction fields

Fix:

Issue: extraction table mixes narrative text with fields

Fix:

Move narrative into a notes column and keep the rest as atomic values (numbers/enums/short strings).

Source Transparency