SPRINT: A new parallel framework for R |
| |
Authors: | Jon Hill Matthew Hambley Thorsten Forster Muriel Mewissen Terence M Sloan Florian Scharinger Arthur Trew Peter Ghazal |
| |
Affiliation: | (1) EPCC, The University of Edinburgh, James Clerk Maxwell Building, Mayfield Road, Edinburgh, EH9 3JZ, UK;(2) Division of Pathway Medicine (DPM), The University of Edinburgh Medical School, Chancellor's building, 49 Little France Crescent, Edinburgh, EH16 4SB, UK |
| |
Abstract: | Background Microarray analysis allows the simultaneous measurement of thousands to millions of genes or sequences across tens to thousands
of different samples. The analysis of the resulting data tests the limits of existing bioinformatics computing infrastructure.
A solution to this issue is to use High Performance Computing (HPC) systems, which contain many processors and more memory
than desktop computer systems. Many biostatisticians use R to process the data gleaned from microarray analysis and there
is even a dedicated group of packages, Bioconductor, for this purpose. However, to exploit HPC systems, R must be able to
utilise the multiple processors available on these systems. There are existing modules that enable R to use multiple processors,
but these are either difficult to use for the HPC novice or cannot be used to solve certain classes of problems. A method
of exploiting HPC systems, using R, but without recourse to mastering parallel programming paradigms is therefore necessary
to analyse genomic data to its fullest. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|