Towards a scientific workflow methodology for primary care database studies.
Curcin, Vasa;
Bottle, Alex;
Molokhia, Mariam;
Millett, Christopher;
Majeed, Azeem;
(2010)
Towards a scientific workflow methodology for primary care database studies.
Statistical methods in medical research, 19 (4).
pp. 378-393.
ISSN 0962-2802
DOI: https://doi.org/10.1177/0962280209359880
Permanent Identifier
Use this Digital Object Identifier when citing or linking to this resource.
We describe the challenges of conducting studies based on mining large-scale primary care databases, namely data integration, data set definition, result reproducibility and reusability. These correspond to higher-level informatics challenges of automation, provenance capture and component integration. We provide a high-level view of the informatics infrastructure that addresses these challenges through a generic workflow-based e-Science middleware, and describe our experiences using the system to investigate differences in the health status of patients with diabetes before and after the national introduction of the UK GP contract in 2004.