Search in:

Applied Artificial Intelligence

An International Journal

Volume 31, 2017 - Issue 3

Submit an article Journal homepage

Free access

915

Views

CrossRef citations to date

Altmetric

Listen

Articles

Image Thresholding Improved by Global Optimization Methods

Felipe BalabanianInstitute of Computing, University of Campinas, Campinas, SP, BrazilView further author information

Eduardo Sant'Ana da SilvaInstitute of Computing, University of Campinas, Campinas, SP, BrazilView further author information

Helio PedriniInstitute of Computing, University of Campinas, Campinas, SP, BrazilCorrespondence[email protected]

http://orcid.org/0000-0003-0125-630X View further author information

Pages 197-208 | Published online: 10 Apr 2017

Cite this article
https://doi.org/10.1080/08839514.2017.1300050
CrossMark

In this article

ABSTRACT
Introduction
Background
Proposed methodology
Experimental results
Conclusions
Additional information
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF

ABSTRACT

Image thresholding is a common segmentation technique with applications in various fields, such as computer vision, pattern recognition, microscopy, remote sensing, and biology. The selection of threshold values for segmenting pixels into foreground and background regions is usually based on subjective assumptions or user judgments under empirical rules or manually determined. This work describes and evaluates six effective threshold selection strategies for image segmentation based on global optimization methods: genetic algorithms, particle swarm, simulated annealing, and pattern search. Experiments are conducted on several images to demonstrate the effectiveness of the proposed methodology.

Introduction

Image segmentation (Bhanu and Lee Citation2012; Gonzalez and Woods Citation2010; Rosenfeld Citation2013; Schwartz and Pedrini Citation2006; Silva et al. Citation2008) plays an important role in several image processing and computer vision tasks. Thresholding is one of the simplest techniques for extracting objects in an image from its background. Thresholding techniques (Ali, Ahn, and Pant Citation2014; Beauchemin Citation2013; Sezgin Citation2004; Singla and Patra Citation2015) can be classified into bilevel and multilevel categories. In bilevel thresholding (Zou et al. Citation2012; Zou, Liu, and Zhang Citation2013), a threshold value is determined to segment the image into two regions, one corresponding to the object in the image and another representing the background. In multilevel thresholding (Ali, Ahn, and Pant Citation2014; Sarkar, Das, and Chaudhuri Citation2015) more than one threshold value is determined to segment the image into background and objects present in the image.

Several image thresholding approaches (Dirami et al. Citation2013; Kapur, Sahoo, and Wong Citation1985; Otsu Citation1979; Weszka, Nagel, and Rosenfeld Citation1974; Ye et al. Citation2012) have been proposed in the literature. Many of them (Otsu Citation1979; Weszka, Nagel, and Rosenfeld Citation1974) are based on shape information of the histogram to select threshold values. In ideal cases, the threshold can be selected at the bottom of a valley between two peaks representing objects and background, respectively. However, it is often difficult to detect valleys precisely in real images. Other methods attempt to optimize an objective function, such entropy maximization (Kapur, Sahoo, and Wong Citation1985), Bayesian error minimization (Ye and Danielsson Citation1988), Bayesian error minimization (Ye and Danielsson Citation1988), and intra-class variance minimization (Otsu Citation1979).

One of the main difficulties related to the use of complex segmentation algorithms is that they have several parameters that are usually defined empirically by users, which affects the quality of segmentation and can vary significantly for different problems.

Since the threshold values defined in a segmentation process can considerably influence the detection and location of objects present in the image, it is important to have a robust method for selecting the best thresholds. In this paper, the parameters used in a number of different image thresholding techniques are automatically defined through four global optimization methods (Floudas Citation2013; Horst and Tuy Citation2013).

The main contributions of the work include the definition of an effective fitness function to evaluate the candidate solutions based on an image quality measure and the automatic selection of appropriate image thresholds through optimization approaches.

The remainder of the paper is organized as follows. “Background” section briefly describes concepts and works related to image thresholding techniques. The proposed method for selecting the best parameters in a number of thresholding methods based on optimization methods is presented in “Proposed methodology ” section. Experimental results are described and discussed in “Experimental results” section. “Conclusions” section presents the conclusions and directions for future work.

Background

The main purpose of thresholding (Beauchemin Citation2013; Dirami et al. Citation2013; Gonzalez and Woods Citation2010; Ye et al. Citation2012) is to extract pixels belonging to objects from an image background. Image thresholding techniques are commonly categorized into global and local approaches (Glasbey Citation1993; Trier and Jain Citation1995). In global thresholding, a single threshold value T is used for all the image pixels, which is suitable for cases where the pixel values corresponding to object components and background are consistent along the entire image. In local thresholding, a different threshold T is adaptively selected for each pixel according to local image characteristics, which is more suitable to accommodate non-uniform lighting conditions in the image.

Global thresholding

The simplest thresholding is to select an intensity value as a threshold, such that the pixel values below this threshold become 0 (black) and the values above this threshold become 1 (white). If T is the global threshold of the original image , then

(1)

A widely used global thresholding approach was developed by Otsu (Otsu Citation1979), which automatically clusters the image pixels into two classes, background and foreground. The method minimizes the intra-class variance (the variance within the class) or, conversely, maximizes the inter-class variance (the variance between the classes).

Considering the range of intensity levels, the intra-class variance can be defined as the weighted sum of the variances of each class

(2)

where and . Value is the variance of the pixels in the background (below the threshold), whereas is the variance of the pixels in the foreground (above the threshold).

The inter-class variance can be defined as

(3)

where and

Local thresholding

Local thresholding techniques are designed to overcome the limitations found in the global thresholding due to changes in lighting conditions or presence of local shadows.

Bernsen’s method (Bernsen Citation1986) calculates the local threshold value based on the mean value of the minimum and maximum intensities of pixels within a window, defined as

(4)

where and are the minimum and maximum intensity values, respectively, within an region centered at .

Niblack’s method (Niblack Citation1986) calculates the threshold value for each pixel based on the local mean and standard deviation, expressed as

(5)

where and are local mean and standard deviation, respectively, within an region of pixel . The region size should be small enough to preserve local details of the image, but also large enough to suppress noise. The value k is used to adjust the fraction of total pixels that belong to foreground object.

Sauvola and Pietaksinen’s method (Sauvola and Pietaksinen Citation2000) calculates the threshold value for each pixel as

(6)

where and are local mean and standard deviation, respectively, within an region of pixel . Sauvola and Pietaksinen suggested the values and .

Phansalskar, More, and Sabale’s method (Phansalskar, More, and Sabale Citation2011) is variation to Sauvola and Pietaksinen’s method (Sauvola and Pietaksinen Citation2000) to deal with low-contrast images. The local threshold is calculated as

(7)

where local mean and standard deviation, respectively, within an region of pixel . They suggested the values , , , and .

The mean method (Jain, Kasturi, and Schunck Citation1995) selects the threshold as the mean of the local intensity distribution in an region. If the intensity of a pixel is above the mean of its neighborhood, then the pixel is set to object; otherwise, it is set as background.

In a similar way, the median method (Jain, Kasturi, and Schunck Citation1995) selects the threshold as the median of the local intensity distribution in an region. If the intensity of a pixel is above the median of its neighborhood, then the pixel is set to object; otherwise, it is set as background.

Proposed methodology

To find the best parameter values for the image thresholding problem, we model a fitness function to guide the selection of candidate solutions. The Structural Similarity Index (SSIM) (Wang et al. Citation2004) is a quality measure calculated between two images f and g is expressed as

(8)

where is the mean of f, is the mean of g, is the variance of f, is the variance of g, and is the covariance of f and g. The SSIM measure is in the range of , such that closer it is to 1, the more similar the images f and g are.

The fitness function is defined as

(9)

where is the set of all possible solutions. Therefore, our goal is to maximize the quality measure computed between an input image f and an estimated image g.

In our work, four global optimization approaches (Arora Citation2015; Chong and Zak Citation2013) are used to generate the estimated image g by automatically selecting the best parameter values employed in the image thresholding techniques described in the previous section.

Genetic Algorithms (GAs) (Holland Citation1975; Koza Citation1992) are heuristic search algorithms based on evolutionary strategies of natural selection, which can be used to generate solutions to search and optimization problems. First, an initial population is created with a number of individuals. Each individual corresponds to a possible solution, that is, it represents a set of possible values for the thresholding parameters. Each gene represents a specific parameter encoded in a bit string. The parameters values for the initial set of solutions are randomly generated. A solution (individual) is evaluated through the fitness function [Equation (8)], where the value resulting from the segmentation is compared to a reference. During the optimization process, the best solutions are selected and new solutions are created from them (reproduction). For each generation, new individuals are created via crossover to compose the next generation. The mutation operator helps prevent local optimum. The evolutionary process continues until a predefined number of generations is reached, such that the segmentation parameters (genes) of the best solution found in the last generation correspond to the final result.

The Particle Swarm Optimization (PSO) (Clerc Citation2010; Kennedy and Eberhart Citation1995) algorithm starts with a random population (called swarm) of candidate solutions (called particles), each one having the parameters to be optimized. At each iteration, every particle adjusts its velocity vector and the best known position of the swarm is updated according to the fitness function [Equation (8)]. The algorithm stores and progressively replaces the best parameters of each particle, as well as the particle that best fits the parameters. The process continues until a predefined number of iterations is performed.

The Simulated Annealing (SA) (Brooks and Morgan Citation1995; Dowsland and Thompson Citation2012) method models the physical process of heating a material and then slowly decreasing the temperature to reduce defects, thus minimizing the system energy. The algorithm starts with a randomly chosen initial solution. At each iteration, a new point is randomly generated. The distance of the new point from the current point is based on a probability distribution with a scale proportional to the temperature. The algorithm accepts not only new points that produce a better solution, but also points that generate a worse solution. This reheating mechanism helps prevent local optimum, such that other possible solutions can be explored globally. As the algorithm proceeds, a cooling schedule (Kirkpatrick, Gelatt, and Vecchi Citation1983) is selected to systematically decrease the temperature, reducing the extent of its search to converge to an optimum.

Pattern Search (PS) (Hooke and Jeeves Citation1961) is a numerical optimization approach that does not require derivatives (gradient) of the fitness function to be optimized. A pattern is a finite set of points (vectors) used for the algorithm to determine which points to search at each iteration. Therefore, the method searches a set of points around the current point to find that one where the value of the fitness function generates a better result than the value at the current point. The algorithm stops when a predefined number of iterations is performed, the variation in the fitness function from one iteration to the next is below a tolerance, or the distance between the point found at one successful poll and the point found at the next one is below a tolerance.

Experimental results

The algorithms proposed in our work were implemented in MATLAB under Linux operating system. Experiments conducted on several images demonstrated the effectiveness of the image thresholding techniques improved by the optimization strategies. Due to space limitations, results for only few images are reported here.

reports some results obtained with the optimization methods for selecting the parameters for image thresholding. Values for the computed SSIM measure, percentage of foreground pixels (black), and estimated parameters are shown for each image thresholding algorithm. The size n for the neighborhood region used in the algorithms ranged from 1 to 30.

For the GA, the population size was defined as 50, the crossover probability was 0.8, the mutation probability was 0.05, and the number of generations G was 10. For the PSO, the population size was defined as 50 and the inertia range was set from 0.1 to 1.1. In the SA algorithm, the initial temperature was set to 100. In the PS algorithm, the initial mesh size was set to 1.0 to perform a local search in a neighborhood around the mesh points, whereas the tolerance on variable was defined as 0.000001 to interrupt the iterations if both the change in position and the mesh size are less than such tolerance.

Images generated with the application of the best estimated parameters are shown in . It is possible to observe that the fitness function was able to produce adequate qualitative segmentation results. Moreover, the SSIM measures achieved through the image thresholding methods are higher when compared to those obtained with traditional global thresholding Otsu’s method (Otsu Citation1979).

Figure 1. Images resulting from the application of the best estimated parameters to each thresholding method. The values of SSIM are provided for each image by taking the original one as reference.

It is worth mentioning that the PS optimization method, as it can be observed from , provided the best qualitative results in relation to the other three methods.

Table 1. Comparison of the results obtained with four different optimization methods.

Download CSV Display Table

Conclusions

The selection of threshold values for image segmentation is typically performed manually based on empirical assumptions; however, this process may significantly affect the quality of the final result.

This work presented and discussed the application of four global optimization methods (GAs, particle swarm, SA, and PS) to six image thresholding techniques.

The automatic threshold values estimated by the algorithms, according to a fitness function based on image quality, were able to generate highly promising results. The proposed method can be used as a preprocessing step in other tasks, such as object location and image interpretation.

As directions for future work, we intend to investigate the improvement of other image processing techniques, such as filtering, registration, and compression.

Funding

The authors are thankful to the São Paulo Research Foundation (FAPESP grant #2015/12228-1) and the Brazilian National Council for Scientific and Technological Development (CNPq grant #305169/2015-7) for their financial support.

Additional information

Funding

References

Ali, M., C. W. Ahn, and M. Pant. 2014. Multi-level image thresholding by synergetic differential evolution. Applied Soft Computing 17 (April):1–11. doi:10.1016/j.asoc.2013.11.018.
Web of Science ®Google Scholar
Arora, R. K. 2015. Optimization: Algorithms and applications. Chapman and Hall/CRC, Taylor & Francis Group, CRC Press Boca Raton, FL, USA.
Google Scholar
Beauchemin, M. 2013. Image thresholding based on semivariance. Pattern Recognition Letters 34 (5):456–62. doi:10.1016/j.patrec.2012.11.017.
Web of Science ®Google Scholar
Bernsen, J. 1986. Dynamic thresholding of grey-level images. Proceedings of the 6th International Conference on Pattern Recognition, Berlin, Germany, 1251–55, October.
Google Scholar
Bhanu, B., and S. Lee. 2012. Genetic learning for adaptive image segmentation, Vol. 287. Springer Science & Business Media, Kluwer Academic Publishers New York, NY, USA.
Google Scholar
Brooks, S. P., and B. J. T. Morgan. 1995. Optimization using simulated annealing. Journal of the Royal Statistical Society, Series D (The Statistician) 44 (2):241–57.
Google Scholar
Chong, E. K. P., and S. H. Zak. 2013. An introduction to optimization. Hoboken, New Jersey, USA: John Wiley & Sons.
Google Scholar
Clerc, M. 2010. Particle swarm optimization. Newport Beach, CA, USA: John Wiley & Sons.
Google Scholar
Dirami, A., K. Hammouche, M. Diaf, and P. Siarry. 2013. Fast multilevel thresholding for image segmentation through a multiphase level set method. Signal Processing 93 (1):139–53. doi:10.1016/j.sigpro.2012.07.010.
Web of Science ®Google Scholar
Dowsland, K. A., and J. M. Thompson. 2012. Simulated annealing. In Handbook of natural computing, Rozenberg, Grzegorz, Bäck, Thomas, Kok, Joost N., Eds., 1623–55. Heidelberg Dordrecht, London, New York: Springer.
Google Scholar
Floudas, C. A. 2013. Deterministic global optimization: Theory, methods and applications, Vol. 37. Springer-Verlag New York, Inc. Secaucus, NJ, USA.
Google Scholar
Glasbey, C. A. 1993. An analysis of histogram-based thresholding algorithms. CVGIP: Graphical Models and Image Processing 55 (6):532–37.
Google Scholar
Gonzalez, R., and R. Woods. 2010. Digital image processing using matlab. India: McGraw Hill Education.
Google Scholar
Holland, J. H. 1975. Adaptation in natural and artificial system: An introduction with application to biology, control and artificial intelligence. Ann Arbor, MI, USA: University of Michigan Press.
Google Scholar
Hooke, R., and T. A. Jeeves. 1961. ”Direct Search” solution of numerical and statistical problems. Journal of the ACM 8 (2):212–29. doi:10.1145/321062.321069.
Google Scholar
Horst, R., and H. Tuy. 2013. Global optimization: Deterministic approaches. Heidelberg, Berlin: Springer Science & Business Media.
Google Scholar
Jain, R., R. Kasturi, and B. G. Schunck. 1995. Machine vision. New York, NY, USA: McGraw-Hill.
Google Scholar
Kapur, J. N., P. K. Sahoo, and A. K. C. Wong. 1985. A new method for gray-level picture thresholding using the entropy of the histogram. Computer Vision, Graphics, and Image Processing 29 (3):273–85. doi:10.1016/0734-189X(85)90125-2.
Google Scholar
Kennedy, J., and R. Eberhart. 1995. Particle swarm optimization. IEEE International Conference on Neural Networks, Perth, Australia, 1942–48.
Google Scholar
Kirkpatrick, S., C. Gelatt, and M. P. Vecchi. 1983. Optimization by simulated annealing. Science 220 (4598):671–80. doi:10.1126/science.220.4598.671.
PubMed Web of Science ®Google Scholar
Koza, J. R. 1992. Genetic programming: On the programming of computers by means of natural selection. Cambridge, MA, USA: MIT press.
Google Scholar
Niblack, W. 1986. An introduction to digital image processing. Englewood Cliffs, NJ, USA: Prentice Hall.
Google Scholar
Otsu, N. 1979. A threshold selection method from gray-level histograms. IEEE Transactions on Systems, Man and Cybernetics 9 (1, January):62–66. doi:10.1109/TSMC.1979.4310076.
Web of Science ®Google Scholar
Phansalskar, N., S. More, and A. Sabale. 2011. Adaptive local thresholding for detection of nuclei in diversity stained cytology images. In Proceedings of the International Conference on Communications and Signal Processing, Kerala, India, 218–20.
Google Scholar
Rosenfeld, A. 2013. Multiresolution image processing and analysis, Vol. 12. Springer-Verlag Berlin Heidelberg New York Tokyo: Springer Science & Business Media.
Google Scholar
Sarkar, S., S. Das, and S. S. Chaudhuri. 2015. A multilevel color image thresholding scheme based on minimum cross entropy and differential evolution. Pattern Recognition Letters 54:27–35. doi:10.1016/j.patrec.2014.11.009.
Web of Science ®Google Scholar
Sauvola, J., and M. Pietaksinen. 2000. Adaptive document image binarization. Pattern Recognition 33:225–36. doi:10.1016/S0031-3203(99)00055-2.
Web of Science ®Google Scholar
Schwartz, W. R., and H. Pedrini. 2006. Textured image segmentation based on spatial dependence using a markov random field model. In the Proceedings of the IEEE International Conference on Image Processing. Atlanta, GA, USA: IEEE, 2449–52.
Google Scholar
Sezgin, M. 2004. Survey over image thresholding techniques and quantitative performance evaluation. Journal of Electronic Imaging 13 (1):146–68. doi:10.1117/1.1631315.
Web of Science ®Google Scholar
Silva, R. D., R. Minetto, W. R. Schwartz, and H. Pedrini. 2008. Satellite image segmentation using wavelet transforms based on color and texture teatures. In the Proceedings of the International Symposium on Visual Computing, Las Vegas, NV, USA: Springer, 113–22.
Google Scholar
Singla, A., and S. Patra. 2015. A context sensitive thresholding technique for automatic image segmentation. In Computational intelligence in data mining, eds. L. C. Jain, H. S. Behera, J. K. Mandal, and D. P. Mohapatra, Volume 32 of Smart Innovation, Systems and Technologies, 19–25. New Delhi, India: Springer.
Google Scholar
Trier, B. D., and A. K. Jain. 1995. Goal-directed evaluation of binarization methods. IEEE Transactions on Pattern Analysis and Machine Intelligence 17 (12):1191–201. doi:10.1109/34.476511.
Web of Science ®Google Scholar
Wang, Z., A. C. Bovik, H. R. Sheikh, and E. P. Simoncelli. 2004. Image quality assessment: From error visibility to structural similarity. IEEE Transcations on Image Processing 13 (4):600–12. doi:10.1109/TIP.2003.819861.
PubMed Web of Science ®Google Scholar
Weszka, J. S., R. N. Nagel, and A. Rosenfeld. 1974. A threshold selection technique. IEEE Transactions on Computers C-23 (December):1322–26. doi:10.1109/T-C.1974.223858.
Web of Science ®Google Scholar
Ye, Q.-Z., and P.-E. Danielsson. 1988. On minimum error thresholding and its implementations. Pattern Recognition Letters 7 (4):201–06. doi:10.1016/0167-8655(88)90103-1.
Web of Science ®Google Scholar
Ye, Z., H. Zhengbing, X. Lai, and H. Chen. 2012. Image segmentation using thresholding and swarm intelligence. Journal of Software 7 (5):1074–82. doi:10.4304/jsw.7.5.1074-1082.
Google Scholar
Zou, Y., H. Liu, E. Song, and Z. Huang. 2012. Image bilevel thresholding based on multiscale gradient multiplication. Computers and Electrical Engineering 38 (4, July):853–61. doi:10.1016/j.compeleceng.2012.03.009.
Web of Science ®Google Scholar
Zou, Y., H. Liu, and Q. Zhang. 2013. Image bilevel thresholding based on stable transition region set. Digital Signal Processing 23 (1):126–41. doi:10.1016/j.dsp.2012.08.004.
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Image Thresholding Improved by Global Optimization Methods

ABSTRACT

Introduction

Background

Global thresholding

Local thresholding

Proposed methodology

Experimental results

Table 1. Comparison of the results obtained with four different optimization methods.

Conclusions

Funding

References

Information for

Open access

Opportunities

Help and information

Image Thresholding Improved by Global Optimization Methods

ABSTRACT

Introduction

Background

Global thresholding

Local thresholding

Proposed methodology

Experimental results

Table 1. Comparison of the results obtained with four different optimization methods.

Conclusions

Funding

Additional information

Funding

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date