Hippocampus and amygdala volumes from magnetic resonance images in children: Assessing accuracy of FreeSurfer and FSL against manual segmentation

Dorothee Schoemaker; Claudia Buss; Kevin Head; Curt A Sandman; Elysia P Davis; M Mallar Chakravarty; Serge Gauthier; Jens C Pruessner

doi:10.1016/j.neuroimage.2016.01.038

Hippocampus and amygdala volumes from magnetic resonance images in children: Assessing accuracy of FreeSurfer and FSL against manual segmentation

Neuroimage. 2016 Apr 1:129:1-14. doi: 10.1016/j.neuroimage.2016.01.038. Epub 2016 Jan 26.

Authors

Dorothee Schoemaker¹, Claudia Buss², Kevin Head³, Curt A Sandman³, Elysia P Davis⁴, M Mallar Chakravarty⁵, Serge Gauthier⁶, Jens C Pruessner¹

Affiliations

¹ McGill Centre for Studies in Aging, McGill University, Montreal, QC, Canada; Douglas Hospital Research Centre, Psychiatry Department, McGill University, Montreal, QC, Canada.
² University of California at Irvine, CA, USA; Charité, Berlin, Germany.
³ University of California at Irvine, CA, USA.
⁴ University of California at Irvine, CA, USA; University of Denver, CO, USA.
⁵ Douglas Hospital Research Centre, Psychiatry Department, McGill University, Montreal, QC, Canada; Biomedical Engineering Department, McGill University, Montreal, QC, Canada.
⁶ McGill Centre for Studies in Aging, McGill University, Montreal, QC, Canada.

Abstract

The volumetric quantification of brain structures is of great interest in pediatric populations because it allows the investigation of different factors influencing neurodevelopment. FreeSurfer and FSL both provide frequently used packages for automatic segmentation of brain structures. In this study, we examined the accuracy and consistency of those two automated protocols relative to manual segmentation, commonly considered as the "gold standard" technique, for estimating hippocampus and amygdala volumes in a sample of preadolescent children aged between 6 to 11 years. The volumes obtained with FreeSurfer and FSL-FIRST were evaluated and compared with manual segmentations with respect to volume difference, spatial agreement and between- and within-method correlations. Results highlighted a tendency for both automated techniques to overestimate hippocampus and amygdala volumes, in comparison to manual segmentation. This was more pronounced when using FreeSurfer than FSL-FIRST and, for both techniques, the overestimation was more marked for the amygdala than the hippocampus. Pearson correlations support moderate associations between manual tracing and FreeSurfer for hippocampus (right r=0.69, p<0.001; left r=0.77, p<0.001) and amygdala (right r=0.61, p<0.001; left r=0.67, p<0.001) volumes. Correlation coefficients between manual segmentation and FSL-FIRST were statistically significant (right hippocampus r=0.59, p<0.001; left hippocampus r=0.51, p<0.001; right amygdala r=0.35, p<0.001; left amygdala r=0.31, p<0.001) but were significantly weaker, for all investigated structures. When computing intraclass correlation coefficients between manual tracing and automatic segmentation, all comparisons, except for left hippocampus volume estimated with FreeSurfer, failed to reach 0.70. When looking at each method separately, correlations between left and right hemispheric volumes showed strong associations between bilateral hippocampus and bilateral amygdala volumes when assessed using manual segmentation or FreeSurfer. These correlations were significantly weaker when volumes were assessed with FSL-FIRST. Finally, Bland-Altman plots suggest that the difference between manual and automatic segmentation might be influenced by the volume of the structure, because smaller volumes were associated with larger volume differences between techniques. These results demonstrate that, at least in a pediatric population, the agreement between amygdala and hippocampus volumes obtained with automated FSL-FIRST and FreeSurfer protocols and those obtained with manual segmentation is not strong. Visual inspection by an informed individual and, if necessary, manual correction of automated segmentation outputs are important to ensure validity of volumetric results and interpretation of related findings.

Keywords: Amygdala; FSL-FIRST; FreeSurfer; Hippocampus; Pediatric population; Segmentation techniques.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Amygdala / anatomy & histology*
Child
Female
Hippocampus / anatomy & histology*
Humans
Image Processing, Computer-Assisted / methods*
Magnetic Resonance Imaging / methods
Male
Neuroimaging / methods*

Abstract

Publication types

MeSH terms

Grants and funding