Skip to main content
  • Original research
  • Open access
  • Published:

Comparison of heterogeneity quantification algorithms for brain SPECT perfusion images



Several algorithms from the literature were compared with the original random walk (RW) algorithm for brain perfusion heterogeneity quantification purposes. Algorithms are compared on a set of 210 brain single photon emission computed tomography (SPECT) simulations and 40 patient exams.


Five algorithms were tested on numerical phantoms. The numerical anthropomorphic Zubal head phantom was used to generate 42 (6 × 7) different brain SPECT simulations. Seven diffuse cortical heterogeneity levels were simulated with an adjustable Gaussian noise function and six focal perfusion defect levels with temporoparietal (TP) defects. The phantoms were successively projected and smoothed with Gaussian kernel with full width at half maximum (FWHM = 5 mm), and Poisson noise was added to the 64 projections. For each simulation, 5 Poisson noise realizations were performed yielding a total of 210 datasets. The SPECT images were reconstructed using filtered black projection (Hamming filter: α = 0.5).

The five algorithms or measures tested were the following: the coefficient of variation, the entropy and local entropy, fractal dimension (FD) (box counting and Fourier power spectrum methods), the gray-level co-occurrence matrix (GLCM), and the new RW.

The heterogeneity discrimination power was obtained with a linear regression for each algorithm. This regression line is a mean function of the measure of heterogeneity compared to the different diffuse heterogeneity and focal defect levels generated in the phantoms. A greater slope denotes a larger separation between the levels of diffuse heterogeneity.

The five algorithms were computed using 40 99mTc-ethyl-cysteinate-dimer (ECD) SPECT images of patients referred for memory impairment. Scans were blindly ranked by two physicians according to the level of heterogeneity, and a consensus was obtained. The rankings obtained by the algorithms were compared with the physicians' consensus ranking.


The GLCM method (slope = 58.5), the fractal dimension (35.9), and the RW method (31.6) can differentiate the different levels of diffuse heterogeneity. The GLCM contrast parameter method is not influenced by a focal defect contrary to the FD and RW methods. A significant correlation was found between the RW method and the physicians' classification (r = 0.86; F = 137; p < 0.0001).


The GLCM method can quantify the different levels of diffuse heterogeneity in brain-simulated SPECT images without an influence from the focal cortical defects. However, GLCM classification was not correlated with the physicians' classification (Rho = − 0.099). The RW method was significantly correlated with the physicians' heterogeneity perception but is influenced by the existence of a focal defect.


Measuring the heterogeneity of medical images has been described for many medical conditions, including in bone marrow to quantify osteoporosis [1, 2], for tumor characterization [35], myocardial metabolism with positron emission tomography (PET) [610], in subjects with Alzheimer’s disease (AD) with PET [11], perfusion imaging in brain single photon emission computed tomography (SPECT) [1214], and perfusion in brain SPECT images in patients suffering from AD [15] or from drug-induced therapy [16].

Heterogeneity is a visual perception that varies by person. In clinical practice, physicians subjectively analyze the heterogeneity of medical images because of the lack of an objective and reliable method to quantify this parameter. In functional imaging, such as nuclear medicine, the analysis of image heterogeneity is complex because of a low signal-to-noise ratio and low spatial resolution. Artifacts from the reconstruction process are another type of noise superimposed on images dedicated to the exam's interpretation. Because of this subjective aspect, quantification and analysis of the heterogeneity of medical images are complex and less reproducible. An ideal mathematical method would be reproducible, objective and would provide results that are correlated with the physician's results. Some methods have been described in the literature to solve the problem of a varying analysis of heterogeneity in medical images. These methods are mostly based on a fractal analysis [15, 17], coefficient of variation calculation [11], and texture analysis [1823]. Recently, descriptors based on the random walk (RW) theory have emerged [2428]. Each of these methods is derived from different mathematical theories and yields a different view/representation/perception of an image.

Functional imaging of the brain with 99mTc-hexamethyl-propylene-amine-oxime (HMPAO) or 99mTc-ethyl-cysteinate-dimer (ECD) SPECT assesses cerebral blood flow (CBF). A CBF measurement with SPECT is valuable for most cerebral diseases, particularly in patients with AD [29, 30] or epilepsy [31, 32]. In these pathologies, SPECT abnormalities are focal, and different methods have been developed to quantify the intensity of focal CBF abnormalities.

Diffuse brain SPECT abnormalities have been described in most systemic diseases, such as systemic lupus erythematosus [33, 34]; hypothyroidism [35]; after chemotherapy [16, 36], carbon monoxide poisoning [37] or manganese toxicity [38]; and in cocaine [39] or alcohol abuse [17]. Because objective quantification is lacking, brain SPECT images were analyzed visually in these previous studies.

The present paper compares our method, which was based on the RW method, with the five methods commonly used in the literature to evaluate the heterogeneity of SPECT images. The comparison was first conducted with simulated brain perfusion SPECT images to assess how the images are influenced by focal cortical defects (such as in AD) and the global diffuse heterogeneity. The goal was to find the best algorithm to discriminate the more diffuse perfusion heterogeneity levels without an influence from focal defects. The second part of the manuscript will compare the classification of these algorithms using real brain HMPAO SPECT perfusion images. The population was composed of 15 normal subjects and 25 patients referred for memory impairment. The goal of this portion of the study was to find the best algorithm that correlated with the physicians' consensus.


To evaluate the RW algorithm, a comparison of five traditional algorithms was performed on phantom and real brain perfusion images. Algorithms were computed only on the cortical rim, obtained with a masking process (described in § 2.2.2) A brief description of each method is presented below:

  1. 1

    The coefficient of variation (CV) is defined as the ratio of the standard deviation and the mean of the intensity levels. In our case, the CV was measured in the whole brain slice (CV3D)

  2. 2.

    The entropy, H, is expressed as:

    H = i = 0 G 1 p ( i ) log 2 [ p ( i ) ]

where p(i) is the density probability of the gray-level occurrence i, and G is the number of gray levels in the image. Entropy was first calculated as a global 3D parameter (entropy 3D). Entropy was also computed as a 2D parameter as follows: each pixel was replaced by the measure of the entropy of its neighborhood (3 × 3 and 5 × 5); the sum of all local entropies in all brain slices represents the final result (entropy 3 × 3 and entropy 5 × 5, respectively).

  1. 3.

    The fractal dimension (FD) followed two algorithms: (1) the box counting (BC) method [40] and (2) the Fourier power spectrum (FPS) method [41].

The 2D FD BC method analyzes the image by the complexity changes in its different gray-level and uses a set of threshold values T (0.3 < T < 0.7) to achieve a set of binary versions of the original image. The fractal dimension is estimated by the slope of the regression line between the size of the boxes (1, 2, 4, 8, and 16 pixels) and the number of the boxes needed to encompass the binary object. In this case, the minimum, maximum, median, mean, and standard deviation were studied along slices.

The Fourier power spectrum method or FD FPS calculated in 2D (mean of the FD along the slices) is based on the power spectrum dependence of the fractional Brownian motion. In the 2D power spectrum method, each line height profile that forms the image is Fourier transformed; the power spectrum is evaluated and all of these power spectra are averaged. The fractal dimension is evaluated from the slope, β, of a least-squares regression line fit to the data points in a log-log plot of the power spectrum, as FD = 7/2 − β/2. The FPS was also calculated in 3D (FD FPS 3D). Reader is referred to [40] for a detailed description of the FD FPS in 2D and its generalization in higher dimensions. Algorithms are derived from the original implementation in [42].

  1. 4.

    In the gray-level co-occurrence matrices method (GLCM) [43, 44], each matrix represents the joint probability distributions between the gray-level values of pairs of pixels at a predetermined distance (1 pixel) and orientation (0°, 45°, 90°, and 135°). The GLCMs were averaged to obtain one GLCM for each 2D slice. Four statistical measures or parameters of the GLCM (i.e., contrast, homogeneity, energy, and correlation) were computed on 255 gray levels. A fifth parameter averaging three spatial distances (1, 2, and 3 pixels) was also created.

  2. 5.

    In the random walk method or RW [27, 28], a virtual walking particle performs a random walk on a 2-dimensional image lattice with a four-neighbor system. The purpose of the RW is to visit a large number of new pixels in homogenous regions and to be confined (i.e., to pass over the same pixels previously visited) within heterogeneous regions. Each step of the RW is decomposed into the following two procedures:

  3. (a)

    Moving procedure

When (i, j) is the current position of the particle, the decision to move or not with a probability (Pmove) is described by the following expression:

P move ( i , j ) = I ( i , j ) max ( I ( x , y ) ) S

where I(i,j) is the intensity of the pixel, (i,j), the denominator is the maximum intensity taken over all pixels (x,y) from the selected slices, and S is a slowing parameter (S = 3). The purpose of this procedure is to differentiate the behavior of the particle according to the local intensity. A number, q, is randomly chosen between [0, 1]. If q < P move, the movement is accepted, and the algorithm continues to the jump procedure. If q ≥ P move and the movement is not accepted at time t, the particle stays at its current position and a new attempt is made at time t + 1.

  1. (b)

    Jump procedure

This procedure is performed only if the movement has been previously accepted. If so, the algorithm decides which of its four neighbors the particle will jump to according to the following transition probabilities. The probability of jumping from a pixel (i,j) to one of its neighbors (k,l) is given by the following expression:

P jump ( i , j ) , ( k , l ) = exp ( I ( k , l ) / T ) ( m , n ) exp ( I ( m , n ) / T )

where the denominator is the sum taken over the four neighbors (m,n) of pixel (i,j), and T is the temperature parameter. The higher T results in an easier jump from high intensity (hot) regions to low intensity (cold) regions. We experimentally chose T = 6.

After 1,000 steps, the algorithm outputs the visiting rate denoted r, which is the number of visited pixels divided by 1,000. To provide a good estimation of the expected value of r, a large number (1,000) of independent random walks on the 2D-image are computed. This procedure is repeated on a selected number of slices in the cortex. At the end, the algorithm outputs R, the average of r-values over all the selected slices. R is the average number of visited pixels as a percentage (R value in percent). The reader is referred to [27] for a full description and validation of the RW algorithm parameter.

Phantom study

To study the robustness of the algorithms and their ability to discriminate between diffuse heterogeneity and a focal defect, a simulation experiment was performed using MATLAB, v 7.0 (Mathworks, Natick, MA, USA) on a standard personal computer platform.

Phantom design

Simulations of normal and pathological phantom are presented in Figure 1.

Figure 1
figure 1

Examples of nine samples from the 210 non-masked numerical Zubal phantoms are represented. The diffuse heterogeneity level and the focal defect level were normalized. Zero is a normal level, and one is the most abnormal level.

Normal phantom

The modified Zubal head phantom [45] was used to simulate normal HMPAO brain perfusion SPECT images. First, the 63 anatomical regions of the magnetic resonance imaging (MRI)-Zubal head phantom were transformed into 19 functional regions. The cortical, extra-cortical, and extra-cerebral blood flow values were obtained from a normal database of 12 controls (8 women and 4 men; mean age ± standard deviation (SD), 74. ± 3.9 years) followed in the prospective Eugeria study [46]. Morphologic imaging (magnetic resonance imaging and computed tomography) was normal in the 12 control subjects. The CBF values were obtained from Catafau et al. [47], and the methodology has been described previously [48]. The cerebellum CBF value was arbitrarily defined as 100. Effects of sex, interhemispheric asymmetry, and age were not taken into account with this model.

Pathological phantom

Diffuse brain heterogeneity and focal perfusion defects were successively simulated. Diffuse heterogeneity of brain perfusion was simulated by adding Gaussian noise only in the cortex region. This Gaussian noise has an adjustable variance and a zero mean. Six diffuse heterogeneity levels were constructed (σ = 0 for normal subjects, and successively σ = 100, 2,000, 4,000, 6,000, 8,000, 10,000 for the six levels of heterogeneity). The focal defect simulation was used to mimic Alzheimer’s disease, which is characterized by temporoparietal (TP) defects. Five different levels of TP perfusion were used (93 for normal subjects, and successively 80, 70, 60, 50, and 40 for the five pathological defects). The different levels of heterogeneity and focal defects were validated by physicians so that the simulations were as realistic as possible compared to clinical practice. A total of 42 phantoms were simulated: (1 normal + 6 heterogeneity levels) × (1 normal + 5 TP cortical defect levels). Heterogeneity values (0 to 10,000) and perfusion defect values (93 to 40) were considered by physicians to cover the range of CBF observed in clinical practice.

Simulation of the acquisition and reconstruction process

The 42 phantoms were successively projected in 64 projections of 128 × 128. The projections were smoothed with a constant Gaussian kernel rather than a distance dependent function to simulate the point spread function of our gamma-camera (DST-Xl, GEMS, Buc, France) equipped with low-energy ultra-high resolution collimators (LE-UHR, GEMS, Buc, France), corresponding to a full width at half maximum of 5.5 mm for the tomographic acquisition. Poisson noise was added to the projections to simulate the radioactivity emission of the tracer. For each phantom, the total number of counts was equal to 10 M. SPECT images were reconstructed using filtered back projection with a Hamming filter (α = 0.5), which led to 42 reconstructed SPECT exams corresponding to a set of seven levels of diffuse heterogeneity and six levels of focal defects. For the statistical analysis, the diffuse heterogeneity level and the focal defect level were normalized to zero, a normal level, and one, which was the most abnormal level (i.e., 40 for the ‘highest’ focal defect level and 10,000 for the highest heterogeneity level). This scaling is integrated in Figure 2. The Poisson noise was repeated five times, mimicking five different acquisitions of each of the 42 simulated phantoms, which resulted in a total of 210 different simulations. All the 210 simulations were used in the statistical analysis and computed with all methods.

Figure 2
figure 2

Diffuse heterogeneity and focal defect discrimination power are plotted for each algorithm tested (box and whiskers (mean, median, and min-max values)).

To study only cortical activity, all phantoms were masked. The outward and inward boundaries of the cortical rim were determined by using a threshold of 80% and 55%, as previously described in Volkow et al. [11]. The right side of the mask was generated symmetrically from the left side.

After reconstruction, the heterogeneity quantification parameters, as described above, were computed for each of the 210 phantoms. The results of each algorithm were normalized for a statistical comparison of the discrimination power (a maximal result was a value of 100).

Comparison statistics design: discrimination powers

For each algorithm, a diffuse heterogeneity discrimination power (HDP) was obtained as follows: for each level of the six TP perfusion defect, a regression line was calculated between the heterogeneity level and the parameter of the algorithm studied (Figure 3A). The slope of each regression line was calculated. The mean and standard deviation of the slopes were plotted for comparison. The mean slope of the six slopes obtained from a linear regression was considered to be the discrimination power of the algorithm studied for the diffuse heterogeneity quantification (Figure 3C). A mean slope of 1 denotes a maximal separation between the levels of diffuse heterogeneity. A smaller standard deviation (i.e., identical slopes for one method) shows that the heterogeneity quantification is more independent of the TP perfusion defect levels.

Figure 3
figure 3

Diffuse heterogeneity discrimination power and perfusion defect levels. (A) For each heterogeneity level, the GLCM contrast parameter results are plotted as a function of the perfusion defect levels, leading to seven regression lines (0.7 < R2 <0.9). (B) For each perfusion defect, the GLCM contrast parameter results are plotted as a function of the heterogeneity levels, leading to six regression lines (0.7 < R2 < 0.9). (C) Box and whiskers (min - max) representation is shown. The mean slope (absolute value) is 1.22, which corresponds to a poor perfusion defect discrimination power (DDP). (D) The mean slope (absolute value) is 58.48, which corresponds to a relatively good HDP.

A focal defect discrimination power (DDP) is obtained in a same way than the diffuse heterogeneity discrimination power for every algorithm. For each of the seven heterogeneity levels, a regression line was calculated between the perfusion defect level and the parameter of the algorithm studied (Figure 3B). The slope of each regression line was calculated. The mean and standard deviation of the slopes were plotted to compare the methods. The mean of the seven slopes obtained by a linear regression was considered to be the discrimination power of the algorithm studied for the TP perfusion defect level (Figure 3D). A mean slope of 1 denotes a maximal separation between the levels of TP. A smaller standard deviation (i.e., identical slopes for one method) shows that the heterogeneity quantification is more independent of the TP perfusion defect levels.

Mean slopes and range of slopes (i.e., standard deviation) are used as box and whiskers to compare all algorithms (Figure 2). The best heterogeneity quantification algorithm is the one with high HDP (mean slope) and low DDP. Moreover, we are searching the algorithm with HDP as less as possible influenced/correlated with the defects levels (i.e., low standard deviation).

Subject study

SPECT population, acquisition, reconstruction, and display

The heterogeneity quantification parameters were applied to 25 ECD SPECT images from subjects with suspected focal defects, including Alzheimer’s disease, frontotemporal dementia, and primary progressive aphasia (8 women and 17 men; mean age ± SD, 63.6 ± 7 years), and from 15 normal subjects (6 dementia women and 9 men; mean age ± SD, 51.1 ± 13.7 years) acquired in a normal prospective study (CIC-INSERM 02-112-HP). All normal subjects had given their written informed consent to participate in the study. The ethical board of the Medical Faculty of Rouen approved the SPECT procedure. For these normal subjects, the clinical, neurological, and neuropsychological examinations (i.e., the Mini Mental Score Examination, the Mattis Dementia Rating Scale, the Grober-Buschke verbal memory test, the Montgomery-Asberg Depression Rating Scale and Goldberg’s depression test) remained normal. The MRI imaging was normal in these 15 control subjects.

SPECT acquisitions were performed in all 40 subjects with a dual-headed gamma camera (DST-XL, GEMS, Buc, France) equipped with LE-UHR collimators. Acquisitions were performed 1 h after an intravenous administration of 1,000 MBq of 99mTc-ECD under low light and sound conditions. The subject's head was safely positioned in an adjustable head holder. For each SPECT acquisition, 64 angular views of 60 s each were obtained through a 360° circular orbit (32 angular views per head). The data were recorded in a 128 × 128 matrix. The SPECT images were reconstructed from the projection data using the filtered back-projection algorithm with a Hann filter (α = 0.5) and a software zoom of 2 (matrix 128 × 128 × 128, voxel size = 1.7 × 1.7 × 1.7 mm). No attenuation correction was performed. The acquisition and reconstruction parameters used are best-adapted for standard brain SPECT studies in clinical practice [49]. The French Sopha Rainbow look-up table (LUT) (Vision Workstation GEMS, Buc, France) was used for the display of brain SPECT exams. The display was done without high or low thresholding. The cerebellum was considered as the region of reference for the LUT normalization. Brain perfusion exams were presented to physicians in 16 slices (4 × 4) set from the top of the cortex to the bottom of cerebellum. Reconstructed images were normalized in the Talairach space using SPM2 [50].

Physicians’ SPECT image interpretation and random walk calculation

The 40 reconstructed images were blindly analyzed and ranked by two physicians according to the degree of diffuse heterogeneity but not according to their focal defect level. The correlation of the ranking of the two observers was calculated. The final ranking was obtained after consensus of the two observers. For each of the five algorithm families, a diffuse heterogeneity parameter was calculated for all SPECT data for 40 slices ranging from AC-PC-10 to AC-PC + 30 after applying both a spatial normalization and a cortical mask.


The rank of each subject, as defined by the physicians’ consensus, was correlated with the rank provided by each of the five algorithm families with a Spearman coefficient (ρ). A statistical significance level of 0.0001 was chosen after Bonferroni correction.

Results and discussion


Phantoms study

The results of the HDP and DDP for each algorithm are plotted in Figure 2.

The three algorithms that discriminated higher heterogeneity levels (i.e., a high HDP) and were least influenced by focal defects (i.e., a low DDP) were the following: the contrast of GLCM parameter had an average HDP of 58.5 (range from 51.6 to 63) and a DDP of 1.2 (0.2 to 2.9); the standard deviation parameter of fractal dimension box counting (FDBC) had an HDP of 35.9 (31.1 to 41.2) and a DDP of 3.8 (0.4 to 7.4); and the RW algorithm had an HDP of 31.6 (22.2 to 44.3) and a DDP of 20.6 (12.3 to 32.6).

Patient study

The correlation between the two physicians’ ranking was ρ = 0.86. The correlation of ranking between every algorithm and the physicians’ consensus is presented in Table 1. The RW algorithm shows a significant correlation with the physicians’ consensus (ρ = 0.85). The standard deviation parameter of the FDBC, like the GLCM contrast parameter ranking, was not significantly correlated with the physicians’ consensus (ρ = −0.46 and ρ = −0.1, respectively).

Table 1 The correlation (Spearmann coefficient) between the algorithms and the physicians’ consensus

The ranking of the 3D and local entropy is significantly correlated (ρ = 0.86). Some parameters of GLCM are correlated to each other (contrast and contrast 3 N, for example; ρ = 0.87).


Several algorithms from the literature were compared with the original RW algorithm for heterogeneity quantification purposes. These algorithms represented a range of approaches for the possible quantification of diffuse brain SPECT perfusion heterogeneity.

The first comparison was achieved on 210 simulated brain SPECT perfusion exams based on the Zubal head phantom. The major simulation characteristic was the seven different heterogeneity levels mixed with six different regional defects levels. There is no gold standard in defining a brain perfusion heterogeneity pattern. Because this definition is lacking, perfusion heterogeneity was simulated with a Gaussian noise of zero mean distributed on the gray matter. The extreme level of heterogeneity (σ = 10,000) was defined by the physicians. The five levels of heterogeneity were gradually obtained between the normal and extreme pathological level. The GLCM contrast parameter was the more robust algorithm for quantifying the diffuse heterogeneity levels without an influence from the focal defects. The RW algorithm provided relatively good results, but it was influenced by two simulation characteristics, heterogeneity, and focal defects. The influence of the size and intensity of the defect and acquisition quality (count level) have been previously tested [28]. Briefly, the minimal count level acquired must be five million for constant RW measurements. The increase of the size and intensity of the defect result in the increase of the RW results. The increase of heterogeneity (simulated noise) induces an increased in the RW results. Most methods (other than RW and GLCM) are sensitive to focal defects since these are based on global rather than local measurements.

The second comparison was achieved by ranking a set of 40 brain SPECT perfusion exams by their diffuse heterogeneity. We have compared the ranking from the different algorithms and the ranking achieved from the physicians’ consensus. We have shown that the RW algorithm is the only one correlated to the physicians’ consensus. In this case, the GLCM contrast parameter was not correlated with the physicians’ consensus.

For the phantom study, we opted for an analytic simulation method. Our modeling takes into account both Poisson noise in projections and loss of spatial resolution (constant Gaussian rather than a distance dependent function). This kind of analytic simulation method was previously described in Aubert-Broche et al. [51]. Other authors have used Monte Carlo simulations to create brain perfusion SPECT images [52]. Monte Carlo simulations are more realistic because they model stochastic aspects related to photon emission, propagation, and interaction. Nevertheless, we believe that this improvement in the realism of the simulation process should not change the performance of the method tested.

No attenuation correction has been done on patients and phantom studies. The brain SPECT images are not corrected from attenuation in our clinical practice. Therefore, we decided not to test this parameter in this work. The attenuation correction of the brain SPECT images could modify the results of all methods. Some of them may have different sensitivity regarding the attenuation correction.

The results of the two comparison studies seem contradictory. The GLCM contrast parameter provided good results for the simulated images but was not correlated to the physicians’ ranking of the real brain SPECT perfusion images. In fact, the contradiction could be ignored because two different characteristics were tested. In the algorithm comparison for the phantom images, we looked at which algorithm discriminated the most different levels of heterogeneity. We also looked at whether this algorithm is influenced by the perfusion defects. Furthermore, these simulated pathologies were perfectly quantified. In the comparison of the algorithms for the patient population, we looked for an algorithm that would rank the population in the same order as physicians for a population likely to present with focal defects but little diffuse heterogeneity.

The RW method and the physicians’ ranking were significantly correlated (ρ = 0.85). It was asked to the physicians to rank the simulated phantoms only regarding the heterogeneity, excluding the intensity of the perfusion defect. The RW method was influenced by the perfusion defects in the diffuse heterogeneity classification in an almost equivalent manner to the physicians’ ranking. However, we demonstrated that RW was influenced by heterogeneity, but also with perfusion defect (Figure 2). A detailed design, behavior, and properties (influence of the two pathologies) of the RW algorithm are provided in an earlier published article [27]. The two physicians were experts in neuro-scintigraphy with almost 10 years of experience each and did not know the different levels of pathology on the phantom set. The physicians classified the patient as normal or abnormal and not (sometimes) clearly as heterogeneous or perfusion defects. When physicians ranked simulated phantoms only by heterogeneity level, they were probably influenced by the abnormality of images caused by perfusion defect. Characterizing the two pathologies at the same time and distinguishing the level was probably a difficult task. The correlation coefficient between the two physicians’ ranking (ρ = 0.86) showed that the consensus was necessary for a final decision, and a tool to reduce inter- and intra-observer bias for diffuse heterogeneity quantification is needed. This point demonstrates a limitation in the RW method. This limitation of the RW algorithm looks similar to the limitation found in clinical interpretation, but is an advantage if the aim is to obtain a similar ranking with the physicians.

The GLCM contrast parameter was not influenced by the focal defects and discriminated diffuse heterogeneity well. Therefore, it was expected that the classification of the population by this algorithm was far from the medical classification.


In conclusion, the patient study demonstrated the following: (1) The definition of a normal SPECT exam is complex because normal subjects can be classified as clinically heterogeneous apart from all other clinical data. (2) It is a difficult task to rank patients with extended focal hypoperfusion as patients not having a diffuse heterogeneity.

The phantom and patient study results demonstrated the following: (1) The RW method was correlated with the medical classification. However, an algorithm, such as GLCM contrast parameter, provided better results than the RW method for the diffuse heterogeneity classification when heterogeneity was strictly defined as a phenomenon that resembles noise (coming from the acquisition or reconstruction process). (2) Both RW and physician’s results appear to reflect both heterogeneity and presence of lesions rather than heterogeneity alone.

These results must be confirmed by future studies of patients with diseases that have only diffuse heterogeneity and no focal defects.


  1. Vokes TJ, Giger ML, Chinander MR, Karrison TG, Favus MJ, Dixon LB: Radiographic texture analysis of densitometer-generated calcaneus images differentiates postmenopausal women with and without fractures. Osteoporos Int 2006,17(10):1472–82. 10.1007/s00198-006-0089-y

    Article  CAS  PubMed  Google Scholar 

  2. Zaia A, Eleonori R, Maponi P, Rossi R, Murri R: MR imaging and osteoporosis: fractal lacunarity analysis of trabecular bone. IEEE Trans Inf Technol Biomed 2006,10(3):484–9. 10.1109/TITB.2006.872078

    Article  PubMed  Google Scholar 

  3. O'Sullivan F, Roy S, O'Sullivan J, Vernon C, Eary J: Incorporation of tumor shape into an assessment of spatial heterogeneity for human sarcomas imaged with FDG-PET. Biostatistics 2005,6(2):293–301. 10.1093/biostatistics/kxi010

    Article  PubMed  Google Scholar 

  4. Tixier F, Le Rest CC, Hatt M, Albarghach N, Pradier O, Metges JP, Corcos L, Visvikis D: Intratumor heterogeneity characterized by textural features on baseline 18 F-FDG PET images predicts response to concomitant radiochemotherapy in esophageal cancer. J Nucl Med 2011,52(3):369–78. 10.2967/jnumed.110.082404

    Article  PubMed Central  PubMed  Google Scholar 

  5. Wyss MT, Hofer S, Hefti M, Bärtschi E, Uhlmann C, Treyer V, Roelcke U: Spatial heterogeneity of low-grade gliomas at the capillary level: a PET study on tumor blood flow and amino acid uptake. J Nucl Med 2007,48(7):1047–52. 10.2967/jnumed.106.038489

    Article  CAS  PubMed  Google Scholar 

  6. Hattori N, Tamaki N, Kudoh T, Masuda I, Magata Y, Kitano H, Inubushi M, Tadamura E, Nakao K, Konishi J: Abnormality of myocardial oxidative metabolism in noninsulin-dependent diabetes mellitus. J Nucl Med 1998,39(11):1835–40.

    CAS  PubMed  Google Scholar 

  7. Inglese E, Leva L, Matheoud R, Sacchetti G, Secco C, Gandolfo P, Brambilla M, Sambuceti G: Spatial and temporal heterogeneity of regional myocardial uptake in patients without heart disease under fasting conditions on repeated whole-body 18 F-FDG PET/CT. J Nucl Med 2007,48(10):1662–9. 10.2967/jnumed.107.041574

    Article  PubMed  Google Scholar 

  8. Nowak B, Sinha AM, Schaefer WM, Koch KC, Kaiser HJ, Hanrath P, Buell U, Stellbrink C: Cardiac resynchronization therapy homogenizes myocardial glucose metabolism and perfusion in dilated cardiomyopathy and left bundle branch block. J Am Coll Cardiol 2003,41(9):1523–8. 10.1016/S0735-1097(03)00257-2

    Article  PubMed  Google Scholar 

  9. Schindler TH, Facta AD, Prior JO, Campisi R, Inubushi M, Kreissl MC, Zhang XL, Sayre J, Dahlbom M, Schelbert HR: PET-measured heterogeneity in longitudinal myocardial blood flow in response to sympathetic and pharmacologic stress as a non-invasive probe of epicardial vasomotor dysfunction. Eur J Nucl Med Mol Imaging 2006,33(10):1140–9. 10.1007/s00259-006-0069-7

    Article  PubMed  Google Scholar 

  10. Shiba N, Kagaya Y, Ishide N, Otani H, Takeyama D, Yamane Y, Chida M, Ikeda J, Ido T, Shirato K: Heterogeneity of myocardial fluoro-18 2-deoxyglucose uptake in patients with apical hypertrophic cardiomyopathy. Jpn Circ J 1997,61(3):223–30. 10.1253/jcj.61.223

    Article  CAS  PubMed  Google Scholar 

  11. Volkow ND, Zhu W, Felder CA, Mueller K, Welsh TF, Wang GJ, de Leon MJ: Changes in brain functional homogeneity in subjects with Alzheimer's disease. Psychiatry Res 2002,114(1):39–50. 10.1016/S0925-4927(01)00130-5

    Article  PubMed  Google Scholar 

  12. Laliberte JF, Meunier J, Mignotte M, Soucy JP: Detection of diffuse abnormal perfusion in SPECT using a normal brain atlas. Neuroimage 2004,23(2):561–8. 10.1016/j.neuroimage.2004.06.029

    Article  PubMed  Google Scholar 

  13. Mustonen T, Koivisto T, Vanninen E, Vanninen R, Kuikka JT: Cerebral perfusion heterogeneity and complexity in patients with acute subarachnoid haemorrhage. Nucl Med Commun 2006,27(2):157–64. 10.1097/01.mnm.0000194399.04820.31

    Article  PubMed  Google Scholar 

  14. Vanninen E, Kuikka JT, Aikia M, Kononen M, Vanninen R: Heterogeneity of cerebral blood flow in symptomatic patients undergoing carotid endarterectomy. Nucl Med Commun 2003,24(8):893–900.

    CAS  PubMed  Google Scholar 

  15. Nagao M, Sugawara Y, Ikeda M, Fukuhara R, Hokoishi K, Murase K, Mochizuki T, Miki H, Kikuchi T: Heterogeneity of cerebral blood flow in frontotemporal lobar degeneration and Alzheimer's disease. Eur J Nucl Med Mol Imaging 2004,31(2):162–8. 10.1007/s00259-003-1343-6

    Article  PubMed  Google Scholar 

  16. Vera P, Rohrlich P, Stievenart JL, Elmaleh M, Duval M, Bonnin F, Bok B, Vilmer E: Contribution of single-photon emission computed tomography in the diagnosis and follow-up of CNS toxicity of a cytarabine-containing regimen in pediatric leukemia. J Clin Oncol 1999,17(9):2804–10.

    CAS  PubMed  Google Scholar 

  17. Kuikka JT, Tiihonen J, Bergstrom KA, Karhu J, Rasanen P, Eronen M: Abnormal structure of human striatal dopamine re-uptake sites in habitually violent alcoholic offenders: a fractal analysis. Neurosci Lett 1998,253(3):195–7. 10.1016/S0304-3940(98)00640-5

    Article  CAS  PubMed  Google Scholar 

  18. Castellano G, Bonilha L, Li LM, Cendes F: Texture analysis of medical images. Clin Radiol 2004,59(12):1061–9. 10.1016/j.crad.2004.07.008

    Article  CAS  PubMed  Google Scholar 

  19. Chabat F, Yang GZ, Hansell DM: Obstructive lung diseases: texture classification for differentiation at CT. Radiology 2003,228(3):871–7. 10.1148/radiol.2283020505

    Article  PubMed  Google Scholar 

  20. Chen W, Giger ML, Li H, Bick U, Newstead GM: Volumetric texture analysis of breast lesions on contrast-enhanced magnetic resonance images. Magn Reson Med 2007,58(3):562–71. 10.1002/mrm.21347

    Article  PubMed  Google Scholar 

  21. Rolland Y, Bézy-Wendling J, Gestin H, Bruno A, Duvauferrier R, Morcet N, Collorec R, Coatrieux JL: Analysis of texture in medical imaging. Review of the literature. Ann Radiol (Paris) 1995,38(6):315–47.

    CAS  Google Scholar 

  22. Wiltgen M, Gerger A, Wagner C, Bergthaler P, Smolle J: Evaluation of texture features in spatial and frequency domain for automatic discrimination of histologic tissue. Anal Quant Cytol Histol 2007,29(4):251–63.

    PubMed  Google Scholar 

  23. Sensakovic WF, Starkey A, Armato SG 3rd: Two-dimensional extrapolation methods for texture analysis on CT scans. Med Phys 2007,34(9):3465–72. 10.1118/1.2760307

    Article  PubMed  Google Scholar 

  24. Backes AR, GonÁalves WN, Martinez AS, Bruno OM: Texture analysis and classification using deterministic tourist walk. Pattern Recognition 2010,43(3):685–94. 10.1016/j.patcog.2009.07.017

    Article  Google Scholar 

  25. Backes AR, Martinez AS, Bruno OM: Texture analysis based on maximum contrast walker. Pattern Recognit Lett 2010,31(12):1701–7. 10.1016/j.patrec.2010.05.022

    Article  Google Scholar 

  26. Backes AR, Martinez AS, Bruno OM: Texture analysis using graphs generated by deterministic partially self-avoiding walks. Pattern Recognition 2011,44(8):1684–9. 10.1016/j.patcog.2011.01.018

    Article  Google Scholar 

  27. Modzelewski R, de la Rue T, Janvresse E, Hitzel A, Menard JF, Manrique A, Gardin I, Gerardin E, Hannequin D, Vera P: Development and validation of the random walk algorithm: application to the classification of diffuse heterogeneity in brain SPECT perfusion images. J Comput Assist Tomogr 2008,32(4):651–9. 10.1097/RCT.0b013e31814fae48

    Article  PubMed  Google Scholar 

  28. Modzelewski R, Janvresse E, de la Rue T, Vera P: Brain perfusion heterogeneity measurement based on Random Walk algorithm: choice and influence of inner parameters. Comput Med Imaging Graph 2010,34(4):289–97. 10.1016/j.compmedimag.2009.11.006

    Article  PubMed  Google Scholar 

  29. Gualdi GF, Colaiacomo MC, Bertini L, Melone A, Rojas M, Di Biasi C: Neuroimaging of Alzheimer disease: current role and future potential. Clin Ter 2004,155(10):429–38.

    CAS  PubMed  Google Scholar 

  30. Van Heertum RL, Tikofsky RS: Positron emission tomography and single-photon emission computed tomography brain imaging in the evaluation of dementia. Semin Nucl Med 2003,33(1):77–85. 10.1053/snuc.2003.127294

    Article  PubMed  Google Scholar 

  31. Camargo EE: Brain SPECT in neurology and psychiatry. J Nucl Med 2001,42(4):611–23.

    CAS  PubMed  Google Scholar 

  32. Richardson MP: Functional imaging in epilepsy. Seizure 2002,11(Suppl A):139–56.

    PubMed  Google Scholar 

  33. Gora M, Dobrzynska L, Koseda-Dragan M: Cerebral strokes in patients with systemic lupus erythematosus. Pol Merkur Lekarski 2003,15(87):235–8.

    PubMed  Google Scholar 

  34. Lass P, Koseda M, Romanowicz G, Mechlińska J, Banecka Z, Góra M, Hebanowski M, Stepień E, Łycza : Cerebral blood flow assessed by brain SPECT with 99mTc-HMPAO utilising the acetazolamide test in systemic lupus erythematosus. Nucl Med Rev Cent East Eur 1998,1(1):20–4.

    CAS  PubMed  Google Scholar 

  35. Nagamachi S, Jinnouchi S, Nishii R, Ishida Y, Fujita S, Futami S, Kodama T, Tamura S, Kawai K: Cerebral blood flow abnormalities induced by transient hypothyroidism after thyroidectomy–analysis by tc-99 m-HMPAO and SPM96. Ann Nucl Med 2004,18(6):469–77. 10.1007/BF02984562

    Article  PubMed  Google Scholar 

  36. Janson ET, Kalkner KM, Eriksson B, Westlin JE, Oberg K: Somatostatin receptor scintigraphy during treatment with lanreotide in patients with neuroendocrine tumors. Nucl Med Biol 1999,26(8):877–82. 10.1016/S0969-8051(99)00059-1

    Article  CAS  PubMed  Google Scholar 

  37. Denays R, Makhoul E, Dachy B, Tondeur M, Noel P, Ham HR, Mols P: Electroencephalographic mapping and 99mTc HMPAO single-photon emission computed tomography in carbon monoxide poisoning. Ann Emerg Med 1994,24(5):947–52. 10.1016/S0196-0644(94)70212-8

    Article  CAS  PubMed  Google Scholar 

  38. Lill DW, Mountz JM, Darji JT: Technetium-99 m-HMPAO brain SPECT evaluation of neurotoxicity due to manganese toxicity. J Nucl Med 1994,35(5):863–6.

    CAS  PubMed  Google Scholar 

  39. Wallace EA, Wisniewski G, Zubal G, VanDyck CH, Pfau SE, Smith EO, Rosen MI, Sullivan MC, Woods SW, Kosten TR: Acute cocaine effects on absolute cerebral blood flow. Psychopharmacology (Berl) 1996,128(1):17–20. 10.1007/s002130050104

    Article  CAS  Google Scholar 

  40. Turner MJ, Blackledge JM, Andrews PR: Fractal geometry in digital imaging. Academic 1998.

    Google Scholar 

  41. Berndt-Schreiber M: Fractal based approaches to morphological analysis of fundus eye images. Computer Recognition Systems 2005, 477–484.

    Chapter  Google Scholar 

  42. Calculation of fractal dimension of a 3D volume using fft. 2005. computer program].

  43. Haralick RM: Statistical and structural approches to texture. Proc IEEE 1979,67(5):786–804.

    Article  Google Scholar 

  44. Lerski RA, Straughan K, Schad LR, Boyce D, Bluml S, Zuna I: MR image texture analysis–an approach to tissue characterization. Magn Reson Imaging 1993,11(6):873–887. 10.1016/0730-725X(93)90205-R

    Article  CAS  PubMed  Google Scholar 

  45. Zubal IG, Harrell CR, Smith EO, Rattner Z, Gindi G, Hoffer PB: Computerized three-dimensional segmented human anatomy. Med Phys 1994,21(2):299–302.

    Article  CAS  PubMed  Google Scholar 

  46. Richards M, Touchon J, Ledesert B, Richie K: Cognitive decline in ageing: are AAMI and AACD distinct entities? Int J Geriatr Psychiatry 1999,14(7):534–540. 10.1002/(SICI)1099-1166(199907)14:7<534::AID-GPS963>3.0.CO;2-B

    Article  CAS  PubMed  Google Scholar 

  47. Catafau AM, Lomeña FJ, Pavia J, Parellada E, Bernardo M, Setoain J, Tolosa E: Regional cerebral blood flow pattern in normal young and aged volunteers: a 99mTc-HMPAO SPET study. Eur J Nucl Med 1996,23(10):1329–1337. 10.1007/BF01367588

    Article  CAS  PubMed  Google Scholar 

  48. Soriani-Lefevre MH, Hannequin D, Bakchine S, Ménard JF, Manrique A, Hitzel A, Kotzki PO, Boudousq V, Vera P: Evidence of bilateral temporal lobe involvement in primary progressive aphasia: a SPECT study. J Nucl Med 2003,44(7):1013–1022.

    PubMed  Google Scholar 

  49. Vera P, De Dreuille O, Bendriem B: Advantage of fan beam collimators for contrast recovery of hyperfixation in clinical SPECT. IEEE Trans Nucl Med 1997, 44: 83–89. 10.1109/23.554830

    Article  Google Scholar 

  50. Friston K, Ashburner J, Poline J, Frith C, Heather J, Frackowiak R: Spatial registration and normalization of images. HumBrainMap 1995, 2: 165–189.

    Google Scholar 

  51. Aubert-Broche B, Grova C, Jannin P, Buvat I, Benali H, Gibaud B: Detection of inter-hemispheric asymmetries of brain perfusion in SPECT. Phys Med Biol 2003,48(11):1505–1517. 10.1088/0031-9155/48/11/302

    Article  CAS  PubMed  Google Scholar 

  52. Grova C, Jannin P, Biraben A, Buvat I, Benali H, Bernard AM, Scarabin JM, Gibaud B: A methodology for generating normal and pathological brain perfusion SPECT images for evaluation of MRI/SPECT fusion methods: application in epilepsy. Phys Med Biol 2003,48(24):4023–4043. 10.1088/0031-9155/48/24/003

    Article  CAS  PubMed  Google Scholar 

Download references


This work was partially supported by grants from the ‘Comité de Seine-Maritime de la Ligue Contre le Cancer.’

Author information

Authors and Affiliations


Corresponding author

Correspondence to Romain Modzelewski.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

TL and EJ participated in the design of the RW algorithm. RM drafted the manuscript, designed the phantom and patient study and performed the statistical analysis. PV conceived of the study, and participated in its design and coordination. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Modzelewski, R., Janvresse, E., de la Rue, T. et al. Comparison of heterogeneity quantification algorithms for brain SPECT perfusion images. EJNMMI Res 2, 40 (2012).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: