Search in:

Connection Science Volume 33, 2021 - Issue 4

Submit an article Journal homepage

Free access

544

Views

CrossRef citations to date

Altmetric

Listen

Articles

VQ-oriented data hiding based on adjustable error compensation strategy

Chin-Chen Changa Department of Information Engineering and Computer Science, Feng Chia University, Taichung, Taiwan

Ji-Hwei Horngb Department of Electronic Engineering, National Quemoy University, Kinmen, TaiwanCorrespondence[email protected]

https://orcid.org/0000-0002-2134-5257

Chia-Shou Shiha Department of Information Engineering and Computer Science, Feng Chia University, Taichung, Taiwan

Xu Wanga Department of Information Engineering and Computer Science, Feng Chia University, Taichung, Taiwan

Pages 835-853 | Received 30 Jun 2020, Accepted 09 Feb 2021, Published online: 18 Mar 2021

Cite this article
https://doi.org/10.1080/09540091.2021.1900073
CrossMark

In this article

1. Introduction
2. Related works
3. The proposed error compensation strategy
4. Experimental results
5. Conclusions
Disclosure statement
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

Nowadays, we greatly depend on Internet mass transmission, of all kinds of data, including critical information among them. Therefore, secure communication is an important topic. Data hiding can embed critical information into carriers, such as images, videos, and so on. Efficiently embedding information into images is the goal of this research. Image steganography techniques utilise a cover image to hide secret data and produce a stego-image by modifying pixel values. After modification, the stego-image is distorted with respect to the cover image. Vector quantisation (VQ) is a lossy image compression technique. The image recovered from the VQ compressed code has distortion compared to the original image. Based on the VQ image, we can hide secret data by modifying the pixel value in a way that the distortion is compensated. The embedding rate of the proposed scheme is adjustable. Experimental results show that our scheme can achieve a high embedding rate in comparison with related works. For low-quality VQ images, embedding can improve the visual quality of the stego-image at the same time.

KEYWORDS:

Vector quantisation
data hiding
error compensation
codebook

1. Introduction

In this rapid developing age of the Internet, people are increasingly focusing on information security. Many methods have been proposed to secure private information (Schafer & Lilian, Citation2017). Data hiding is one such protection approach. Data hiding (Artz, Citation2001; Du & Hsu, Citation2003; Cheng & Huang, Citation2001; Yang et al., Citation2008; Lin et al., Citation2009; Khan et al., Citation2014; Chang et al., Citation2014; Chang et al., Citation2018; Chang et al., Citation2019a; Chang et al., Citation2019b) embeds important data into carriers, preventing the data from been noticed to achieve the goal of data protection. This process is different from encryption, which converts the secret information into unrecognisable data. The most commonly applied carriers for data are texts, images, and videos. Recently, the application domain has been extended to various other media, such as computer software (Alrehily & Thayananthan, Citation2018; Wang et al., Citation2018), remote sensing data (Carpentieri et al., Citation2019), and DNA microarray images (Pizzolante et al., Citation2018).

Data hiding techniques can be categorised into reversible and irreversible and are widely used in cloud applications for accessorial information embedding (Xiao et al., Citation2021), such as embedding keywords for image retrieval (Ying et al., Citation2021). The payload in reversible data hiding is very low; however, the carrier can be perfectly recovered after the secret data has been extracted. Irreversible data hiding can achieve a higher payload, but the carrier is distorted and cannot be recovered. Both methods have advantages and disadvantages. The advantage of irreversible data hiding is the high payload. Therefore, improving the payload with the lowest distortion is a major concern.

Like other block-based image compression methods, such as block truncation coding (Chen et al., Citation2020), vector quantisation (VQ) is a lossy image compression technique (Linde et al., Citation1980; Nasrabadi & King, Citation1988). VQ can be divided into three phases: codebook training, image compression, and decompression. The conventional codebook design is to leverage the Lindo-Buzo-Gray (LBG) algorithm (Linde et al., Citation1980). Researches have been proposed to accelerate the codebook design (Chang & Hu, Citation1998; Lai & Liaw, Citation1996; Lin & Tai, Citation1998) and index code search (Hu et al., Citation2008; Hu & Chang, Citation2003; Lu & Chang, Citation2009). Based on the codebook, an image can be compressed into an index table. In the decompression phase, the index table is converted back to an image according to the same codebook. Details will be discussed in Section 2.

As mentioned earlier, VQ is a lossy image compression. Method where the image converted back from the index table is not exactly the same as the original image (Xu et al., Citation2021). Hiding data into a cover image also produces a distorted stego-image. Some research works have been proposed to utilise VQ-compressed images to hide secret data. In 2019, Lee et al. published a survey of data hiding based on VQ (Lee et al., Citation2019). Secret data can be embedded into the index (Manohar & Kieu, Citation2018; Qin & Hu, Citation2016; Rahmani et al., Citation2016; Rahmani & Dastghaibyfard, Citation2018a), the codebook (Pan & Wang, Citation2018; Rahmani & Dastghaibyfard, Citation2018b), or the image reconstructed from the compressed code (Huang et al., Citation2018; Huang et al., Citation2019). In 2018, Huang et al. proposed a data hiding scheme based on the image recovered from the VQ compressed code (Huang et al., Citation2018). By storing the index, the decompressed image can be recovered after the secret data extraction. In the following year, they proposed an improved version of their scheme (Huang et al., Citation2019). By comparing with the original image, the secret data is embedded in such a way that the stego-image is closer to the original image than the input. The details are presented in Section 2.

This paper is organised as follows. The vector quantisation and Huang et al.’s (Citation2019) strategy are introduced in Section 2. The newly proposed error compensation data hiding scheme is presented in Section 3. Experimental results are given in Section 4. Discussions and conclusions are given in the last section.

2. Related works

In this section, we will introduce the VQ image compression technique and the VQ-based data hiding scheme proposed by Huang et al.

2.1. VQ image compression

VQ is a lossy image compression technique (Linde et al., Citation1980; Nasrabadi & King, Citation1988) that can be applied to compress different types of data. Here, we will apply VQ to compress gray-level images. The original image is divided into nonoverlapping blocks of size $n \times n$ first. Then, applying a codebook training algorithm to obtain $k$ different matrices, called codewords, of size such that these matrices best represent the blocks of the original image. In the compression phase, each block of the original image is represented by the index of its best fitted codeword. Thus, an image can be compressed into an index table together with a codebook. In the decompression phase, the image is recovered by tiling up the codewords according to the index table and the codebook.

Typically, the LBG algorithm proposed by Linde et al. (Linde et al., Citation1980) is utilised to obtain the codebook. The LBG algorithm is based on $k$ -means clustering and can produce satisfactory compressed images. In most image applications, the size of a codeword is $4 \times 4$ , and the length of an index is $8$ , which means the codebook contains $2^{8} = 256$ codewords.

An illustrative example is shown in Figure . The original image (a) is divided into nonoverlapping blocks. The example block at the upper left corner is shown in (b). Then, check the codebook to find the best matching, i.e. minimum square error, codeword, as shown in (c). Finally, represent the block by the index of the matched codeword.

Figure 1. An illustration of Vector Quantization.

2.2. Huang et al.’s data hiding scheme

In 2019, Huang et al. proposed a data hiding scheme based on VQ (Huang et al., Citation2019). The main idea of their scheme is to embed secret data by shrinking the errors between the image reconstructed from the VQ-compressed code and the cover image.

In their data hiding scheme, each decompressed block $W = {w_{1}, w_{2}, \dots, w_{16}}$ is compared with its corresponding original block $C = {c_{1}, c_{2}, \dots, c_{16}}$ . Secret data is then embedded in a way that each pixel value $w_{i}$ is modified with a dynamic amount of secret data $d$ such that (1) $w_{i}^{'} = {\begin{matrix} w_{i} + d, & | & w_{i} < c_{i^{'}} \\ w_{i} - d, & | & w_{i} \geq c_{i^{'}} \end{matrix}$ (1) subject to (2) $| w_{i}^{'} - c_{i} < t, |$ (2) where $t$ is a predefined threshold of error tolerance.

By saving the index $I$ of the codeword $W$ , we can extract secret data $d$ by subtraction $d = | w_{i}^{'} - w_{i} |$ . A problem arises where, since the amount of modification is dynamic, the number of embedded bits also varies from pixel to pixel. Therefore, to avoid ambiguity, each segment of data $d$ should be led with “1”. In cases of “0” leading bits, the whole segment is flipped and the flipping marker is checked, i. e. $f_{i} = 1$ .

The data structure of a stego-image block is shown in Figure , where $w_{i}^{0}$ , $w_{i}^{1}$ , and $w_{i}^{2 \sim 7}$ are the least significant bit (LSB-1), the second least significant bit (LSB-2), and the six most significant bits (MSB) of the stego-block. The blue region stores the index of the codeword, i.e. $I = w_{8}^{0} w_{7}^{0} \dots w_{1}^{0}$ ; the green region stores the flip marker, i.e. $F = w_{1}^{1} w_{2}^{1} \dots w_{8}^{1} w_{9}^{0} w_{10}^{0} \dots w_{16}^{0}$ ; and the yellow region serves as $w_{i}$ for embedding secret data.

Figure 2. The data structure of Huang et al.’s scheme (Huang et al., Citation2019).

Huang et al.’s data hiding scheme produces a stego-image of good visual quality at low payloads. However, the flipping marker bit is not worth the sacrifice in high error tolerance.

3. The proposed error compensation strategy

Based on the same framework as Huang et al.’s data hiding scheme, we propose an error compensation strategy to improve the embedding capacity of a VQ-based approach. In the first subsection, we define three types of secret data mapping tables. According to the mapping table, the binary secret bit stream can be converted to signed decimal values. In the second subsection, we propose a data hiding scheme to embed the signed decimal secret digits.

3.1. Three types of mapping tables

Before data embedding, we map the secret bits into signed decimal values. To embed an adjustable amount of secret data, three types of mapping tables are designed as shown in Table . For Type I in (a), two bits of binary data can be mapped to signed decimal values of $- 2$ to $+ 1$ ; for Type II in (b), three bits are mapped to values of $- 4$ to $+ 3$ ; for Type III in (c), four bits are mapped to values of $- 8$ to $+ 7$ . Thus, the required bits of data can be embedded with a minimum amount of absolute values with a signed bit.

Table 1. Three types of mapping tables.

Display Table

These mapping tables reveal a slight defect, where the maximum value is asymmetric for positive and negative signs. To make the mapping table symmetric, we further design an extended version as shown in Table . For Type I+ in (a), the case of “11” is split into “110” and “111”. Therefore, there are five total cases that map into a symmetric range of $- 2$ to $+ 2$ . In a similar manner, we split the last case of Types II and III and the newly generated cases are arranged in both ends of the tables as shown in (b) and (c), to produce a mapping table of Types II+ and III+. In applying the extended mapping table, the embedding rate is a variable depending on the data value.

Table 2. Three types of extended mapping tables.

Display Table

3.2. The VQ-based data embedding scheme with error compensation

The framework of the proposed scheme is similar to Huang et al.’s scheme, as introduced in Section 2.2 (Huang et al., Citation2019). The cover image is divided into non-overlapping blocks of size $4 \times 4$ first. Then, each image block is represented by an index of the VQ codebook to produce the VQ compressed image. The decompressed image can be obtained by replacing each index with its corresponding codeword. Our main idea is to compute the pixel values of the stego-image block $\hat{C} (k) = {{\hat{c}}_{1}, {\hat{c}}_{2}, \dots, {\hat{c}}_{16}}$ based on its corresponding VQ decompressed image block $W (I (k)) = {w_{1}, w_{2}, \dots, w_{16}}$ , where $I (k)$ and $W (I (k))$ are the corresponding index and codeword of the $k$ -th image block.

A stego-image block is shown in Figure , where the data structure of the proposed data hiding scheme is illustrated. The $4 \times 4$ block is divided into upper half and lower half sub-blocks. The eight bit planes of both sub-blocks are displayed in the vertical direction. The bits in the green region are used to store the block index, i.e. ${\hat{c}}_{8}^{0} {\hat{c}}_{7}^{0} \dots {\hat{c}}_{1}^{0} = I$ ; the bits in the purple region are used to store the sign bits, i.e. ${\hat{c}}_{1}^{1} {\hat{c}}_{2}^{1} \dots {\hat{c}}_{8}^{1} {\hat{c}}_{9}^{0} {\hat{c}}_{10}^{0} \dots {\hat{c}}_{16}^{0} = sign (S^{'})$ ; while the MSBs in the blue and yellow regions are applied to embed the absolute value of a decimal encoded data.

Figure 3. The data structure of a stego-image block.

Before embedding, we have to decide the embedding rate first. Since the available MSBs for the upper half and the lower half sub-blocks are not equal, different embedding rates are suggested. We take an example of embedding 2 bits per pixel in the upper half block and embedding 3 bits per pixel in the lower half block. Thus, the embedding process should refer to Type I and Type II tables, as shown in Table (a) and (b). For embedding of each pixel, the segment of secret bits $S$ is mapped to a decimal value $S^{'}$ according to the mapping table first. Then, the decimal value $S^{'}$ is embedded by Equation (3) for a pixel in the upper half sub-block. (3a) $\begin{aligned} {\hat{c}}_{i}^{1} & = sign (S_{i}^{^{'}}) \end{aligned}$ (3a) (3b) $\begin{aligned} {\hat{c}}_{i}^{2 \sim 7} & = {\begin{matrix} w_{i}^{2 \sim 7} + | S_{i}^{^{'}} |, w_{i}^{2 \sim 7} \leq c_{i}^{2 \sim 7}, \\ w_{i}^{2 \sim 7} - | S_{i}^{^{'}} |, w_{i}^{2 \sim 7} > c_{i}^{2 \sim 7} . \end{matrix} \end{aligned}$ (3b)

While the decimal value $S^{'}$ is embedded by Equation (4) for a pixel in the lower half sub-block. (4a) $\begin{aligned} {\hat{c}}_{i}^{0} & = sign (S_{i}^{^{'}}) \end{aligned}$ (4a) (4b) $\begin{aligned} {\hat{c}}_{i}^{1 \sim 7} & = {\begin{matrix} w_{i}^{1 \sim 7} + | S_{i}^{^{'}} |, w_{i}^{1 \sim 7} \leq c_{i}^{1 \sim 7}, \\ w_{i}^{1 \sim 7} - | S_{i}^{^{'}} |, w_{i}^{1 \sim 7} > c_{i}^{1 \sim 7} . \end{matrix} \end{aligned}$ (4b)

The LSBs of the upper half sub-block are reserved to record the VQ index of the current block. The data embedding algorithm with mapping tables of Type I and II is summarised as follows.

3.2.1. Data embedding algorithm with mapping table Types I and II

Input: cover image $C$ , binary secret stream, VQ codebook with size 256

Output: stego-image $\hat{C}$

Step 1:	Partition the cover image of size $W \times H$ into mutually exclusive blocks of size $4 \times 4$ , i.e. $C = {C (k) \| k = 1, 2, \dots, W / 4 \times H / 4}$ .
Step 2:	For each image block $C (k)$ , search the codebook to find the minimum square error codeword $W (I (k))$ , where $I (k)$ is its index in the codebook.
Step 3:	Obtain the decompressed block $W (I (k)) = {w_{1}, w_{2}, \dots, w_{16}}$ , its corresponding cover image block $C (k) = {c_{1}, c_{2}, \dots, c_{16}}$ , and initiate the empty stego-image block $\hat{C} (k) = {{\hat{c}}_{1}, {\hat{c}}_{2}, \dots, {\hat{c}}_{16}}$ .
Step 4:	Index $I$ is inserted to the LSB of the upper half plane by ${\hat{c}}_{8}^{0} {\hat{c}}_{7}^{0} \dots {\hat{c}}_{1}^{0} = I (k)$ .
Step 5:	For each pixel ${\hat{c}}_{i}, i = 1, 2, \dots, 8$ , do Steps 6–8.
Step 6:	Retrieve 2 bits of binary stream $S_{i}$ and map to $S_{i}^{'}$ according to mapping table Type I.
Step 7:	Store the sign of $S_{i}^{'}$ by ${\hat{c}}_{i}^{1} = sign (S_{i}^{'})$ .
Step 8:	Embed the absolute value $\| S_{i}^{'} \|$ by Equation (3).
Step 9:	For each pixel ${\hat{c}}_{i}, i = 9, 10, \dots, 16$ , do Steps 10–12.
Step 10:	Retrieve 3 bits of binary stream $S_{i}$ and map to $S_{i}^{'}$ according to mapping table Type II.
Step 11:	Store the sign of $S_{i}^{'}$ by ${\hat{c}}_{i}^{0} = sign (S_{i}^{'})$ .
Step 12:	Embed $\| S_{i}^{'} \|$ by Equation (4).
Step 13:	Go to Step 2, until all blocks are processed.

For each pixel ${\hat{c}}_{i}$ , its $j$ -th lowest bit is denoted as ${\hat{c}}_{i}^{j}$ . The truncated pixel ${\hat{c}}_{i}^{2 \sim 7}$ of length 6 is applied to embed 2 bits of secret data by referring to mapping table Type I for the upper half sub-block, while ${\hat{c}}_{i}^{1 \sim 7}$ of length 7 is applied to embed 3 bits by referring to mapping table Type II.

Due to the extra space reserved for storing the index value, the truncated pixel lengths are different for the upper and the lower half sub-blocks. To obtain evenly modified pixels throughout the image, we tend to embed more bits in the lower half sub-block. Different mapping table type combinations are possible. In applying the plus (+) series of mapping tables given in Table , the splitting occasions should be checked at Steps 6 and 10. If that happens, one additional bit should be retrieved before mapping to $S^{'}$ . As a consequence, the embedding capacity of applying the extended mapping table is slightly larger than its original version.

In Steps 8 and 12, the MSB of the processing pixel is compared to the corresponding cover image pixel with the same bit-length and is modified to always compensate the error between them. Since the step size of the compensation is determined by the secret bits, it may result in over or under compensation. However, this strategy is always better than direction modification based on the original pixel value of the cover image.

Another possible variation of the embedding scheme is to alter the codebook to a version with different number of codewords $L$ , where typical numbers other than 256 are 128 and 512. Under such circumstances, the index bit-length $n$ is determined first by $L = 2^{n}$ . At Step 4, the index insertion is given by ${\hat{c}}_{n}^{0} {\hat{c}}_{n - 1}^{0} \dots {\hat{c}}_{1}^{0} = I (k)$ ; the execution arrangement of Steps 5–8 and Steps 9–12 are altered by $i = 1, 2, \dots, n$ and $i = n + 1, n + 2, \dots, 16$ , respectively.

3.3. Data extraction

The data extraction process can be devised according to the embedding process. The stego-image is divided into blocks of size $4 \times 4$ . For each block, truncate the bit planes which store the index and the sign bits. According to the index value, get the corresponding codeword to construct the decompressed image block and truncate in the same way with the stego-block. Each truncated pixel value in the stego-image block is compared with its corresponding pixel in the decompressed block to extract the secret data. The algorithm is summarised as follows.

3.3.1. Data extraction algorithm

Input: stego-image, VQ codebook with size 256

Output: secret binary stream

Step 1:	Partition the stego-image of size $W \times H$ into mutually exclusive blocks of size $4 \times 4$ , i.e. $\hat{C} = {\hat{C} (k) \| k = 1, 2, \dots, W / 4 \times H / 4}$ .
Step 2:	For each image block $\hat{C} (k)$ , truncate the LSB to obtain $\hat{I} (k) = {\hat{c}}_{8}^{0} {\hat{c}}_{7}^{0} \dots {\hat{c}}_{1}^{0}$ and $sign (S_{1 \sim 16}^{'}) = {\hat{c}}_{1}^{1} {\hat{c}}_{2}^{1} \dots {\hat{c}}_{8}^{1} {\hat{c}}_{9}^{0} {\hat{c}}_{10}^{0} \dots {\hat{c}}_{16}^{0}$ .
Step 3:	According to $\hat{I} (k)$ , find the corresponding codeword $W (\hat{I} (k)) = {w_{1}, w_{2}, \dots, w_{16}}$ .
Step 4:	For each pixel ${\hat{c}}_{i}, i = 1, 2, \dots, 8$ , compute $\| S_{i}^{'} \| = \| {\hat{c}}_{i}^{2 \sim 7} - w_{i}^{2 \sim 7} \|$ ; then, refering to the Type I table, combine $sign (S_{i}^{'})$ and $\| S_{i}^{'} \|$ to map the binary secret bits $S_{i}$ and record to an output file.
Step 5:	For each pixel ${\hat{c}}_{i}, i = 9, 10, \dots, 16$ , compute $\| S_{i}^{'} \| = \| {\hat{c}}_{i}^{1 \sim 7} - w_{i}^{1 \sim 7} \|$ ; then, refer to Type II table, combine $sign (S_{i}^{'})$ and $\| S_{i}^{'} \|$ to map the binary secret bits $S_{i}$ and record to an output file.
Step 6:	Go to Step 2, until all blocks are processed.

3.4. Example of embedding and extraction

An example of the proposed embedding scheme is shown in Figure . Figure (a) is a VQ decompressed block, and Figure (b) is the corresponding original image block. The LSB of the upper half sub-block in (a) is truncated to obtain (c), and the LSB plane of the whole block is truncated once more to get (d). The original image block (b) does repeats process to get (b’).

Figure 4. An example of data hiding.

As shown in the figure, the secret data ${10, 01, 11, 01, 00, 01, 11, 10}$ is mapped to ${+ 0, - 1, + 1, - 1, - 2, - 1, + 1, + 0}$ by referring to Type I table. The absolute values are ${0, 1, 1, 1, 2, 1, 1, 0}$ . Comparing the blue region of (d) with (b’), we add an absolute value when the corresponding pixel value in (d) is lower than (b’); subtract otherwise. The upper sub-block ${15, 14, 14, 14, 15, 15, 15, 15}$ becomes ${15 - 0, 14 + 1, 14 + 1, 14 + 1, 15 + 2, 15 + 1, 15 + 1, 15 + 0} = {15, 15, 15, 15, 17, 16, 16, 15}$ .

For the lower sub-block, the secret data ${010, 101, 110, 011, 001, 100, 111, 000}$ is first transferred to ${- 2, + 1, + 2, - 1, - 3, + 0, + 3, - 4}$ according to the Type II table. Then, compare the orange region of (e) and (b’) to embed the absolute values. The sub-block ${28, 29, 31, 30, 30, 28, 29, 32}$ is modified to become ${28 + 2, 29 + 1, 31 + 2, 30 + 1, 30 - 3, 28 + 0, 29 + 3, 32 - 4} = {30, 30, 33, 31, 27, 28, 32, 28}$ as shown in (f). Next, the sign bits ${0, 1, 0, 1, 1, 1, 0, 0; 1, 0, 0, 1, 1, 0, 0, 1}$ are inserted into the LSB plane to get (g), and the index $I = 72 = {0, 1, 0, 0, 1, 0, 0, 0}$ is inserted into the LSB of the upper sub-block to get the final stego-image block (h).

The extraction process is shown in Figure . The upper half of stego-image block (a) is truncated into (b) and obtains index $I = 72 = {0, 1, 0, 0, 1, 0, 0, 0}$ . According to the index, the corresponding codeword is obtained as shown in (c). The LSB plane of (b) is truncated again to get sign bits and result in (d). The truncated stego-image block and truncated codeword are compared to obtain the absolute values of $S^{'}$ . Combine the absolute values and sign bits to get $S^{'}$ , and the secret binary stream can be recovered by referring to the corresponding embedding mapping tables.

Figure 5. An example of data extraction.

4. Experimental results

We use an Intel® Core™ i7-4790 [email protected] GHz and 10GB RAM PC as the platform to execute our experiments. Nine gray level standard test images of size $512 \times 512$ were used in our experiments, as shown in Figure . We randomly selected the four images (b) Baboon, (f) Goldhill, (g) Lena, and (h) Peppers as the input of LBG algorithm (Linde et al., Citation1980) introduced in Section 2.1 to get codebooks of three different sizes, including 128, 256, and 512 codewords. The secret binary stream was generated by a random number generator provided by the MATLAB programming language.

Figure 6. Nine standard test images of size 512×512 applied in our experiments.

4.1. Background analysis

The proposed data hiding scheme is based on the VQ decompressed image. To understand the visual quality of the VQ decompressed image, we adopted the measure of peak signal to noise ratio (PSNR), which is defined as (5) $\begin{aligned} P S N R & = 10 lo g_{10} \frac{255^{2}}{M S E}, \end{aligned}$ (5) (6) $\begin{aligned} M S E & = \frac{1}{W \times H} \sum_{i = 1}^{H} \sum_{j = 1}^{W} (I_{i, j} - I {^{'}}_{i, j})^{2}, \end{aligned}$ (6) where $I_{i, j}$ is the original test image, $I {^{'}}_{i, j}$ is the VQ decompressed image, and $W \times H$ is the image size. The PSNR values of nine VQ decompressed images are listed in Table , where the experimental values corresponding to the three different codebook sizes are provided. It can be observed that the PSNR value degrades with a decreasing codebook size. However, the PSNR level mostly relies on the inherent characteristics of the test image. The image “Baboon” is the worst case, while the image “Zelda” is best approximated by the VQ decompressed image.

Table 3. PSNR of VQ decompressed images with different codebook sizes.

Download CSV Display Table

Based on the VQ image of three different codebook sizes, we can embed the secret data with different types of mapping tables. According to the proposed embedding algorithm, the upper half block and the lower half block are embedded with different types of mapping tables. The only basic rule is that the lower half block is capable of hiding more bits with the same visual quality degradation level. In our experiment, five mapping table combinations are applied, including (1) Type I and Type II, (2) Type I+ and Type II+, (3) Type II and Type II, (4) Type II and Type III, and (5) Type II+ and Type III+.

For the typical codebook size of 256, case (1) can hide 2 bits per pixel (bpp) in the upper half block and 3 bits per pixel in the lower half block. This results in an average embedding rate (ER) of 2.5 bits per pixel. The embedding rate of cases (3) and (4) can be derived in the same way to get 3 and 3.5 bits per pixel, respectively. Let’s now turn to the more complicated case of (2). Referring to the mapping table of Type I+ in Table (a), the four 2-bit combinations of data “00”, “01”, “10”, and “11” have equal probability of occurring. Therefore, their probabilities are all 1/4. In the special situation that “11” occur, we embedded an additional bit. The average embedding rate can be calculated by: (7) $\begin{aligned} E R_{I +} = 2 \times \frac{1}{4} + 2 \times \frac{1}{4} + 2 \times \frac{1}{4} + 3 \times \frac{1}{4} = \frac{9}{4} = 2.25 \frac{bits}{pixel} . \end{aligned}$ (7)

In the same way, we can calculate the average embedding rate of mapping table Type II+ by: (8) $\begin{aligned} E R_{II +} = \sum_{i = 1}^{7} (3 \times \frac{1}{8}) + 4 \times \frac{1}{8} = \frac{29}{8} = 3.125 \frac{bits}{pixel} . \end{aligned}$ (8)

Finally, we average the ER of two portions to get: (9) $\begin{aligned} E R_{(2)} = \frac{2.25 + 3.125}{2} = 2.69 \frac{bits}{pixel} . \end{aligned}$ (9)

The ER of case (5) can be derived in a similar way to get 3.59 bpp.

4.2. Performance evaluation

The PSNR values for different embedding rates based on the 256 codebook size are listed in Table . The corresponding figure is shown as Figure . As shown in the figure, the PSNR degrades with increasing ER. The only test image that violates this rule is “Baboon”. Since the VQ decompressed image is largely deviated from the original image as indicated by the PSNR value, the error compensation embedding strategy can shrink the deviation. Therefore, the PSNR increases after embedding the secret data. This phenomenon does not happen in the test images where the VQ decompressed image well approximates the original image. The embedding may over compensate the approximation error.

Figure 7. PSNR for different embedding rates (codebook size: 256).

Table 4. PSNR for different embedding rates (codebook size: 256).

Download CSV Display Table

Another viewpoint is that the PSNR of case (1) is very close to case (2), and case (4) is very close to case (5). This phenomenon can be explained by referring to Tables and , where the embedding of an additional bit does not alter the absolute value of the pixel value modification. The only change is the sign bit, which is located at the lower bit and has a minor effect on the image quality.

Similar phenomena can be found in the experimental data of codebooks sized 128 and 512 shown in Tables and as well as Figures and . The only difference is the slight change of ER. Due to the change of codebook size, the required index length changed. Thus, the upper half block contains 7 pixels for size 128 and 9 pixels for size 512. The ER for all cases is recalculated as listed in the tables.

Figure 8. PSNR for different embedding rates (codebook size: 128).

Figure 9. PSNR for different embedding rates (codebook size: 512).

Table 5. PSNR for different embedding rates (codebook size: 128).

Download CSV Display Table

Table 6. PSNR for different embedding rates (codebook size: 512).

Download CSV Display Table

To assess the influence of data embedding on the visual quality of the stego-images, the experimental results are shown in Figure . Two typical test images “Lena” and “Baboon”, are given for different codebook sizes with the largest ER of the experiment. For all cases, the stego-image cannot be distinguished from the original image by human vision. The effect of the codebook size on the PSNR value is shown in Figure . For each test image, the PSNR is obtained by averaging the PSNR of different ERs under the same codebook size. As shown in the figure, the PSNR increases with an increasing codebook size on average.

Figure 10. Stego-images for different codebook sizes.

Figure 11. Comparison of average PSNR for different codebook sizes.

4.3. Comparison with related works

The proposed data hiding scheme is compared with Rahmani et al.’s scheme (Rahmani et al., Citation2016), Rahmani and Dastghaibyfard’s scheme (Rahmani & Dastghaibyfard, Citation2018b), and Huang et al.’s two schemes (Huang et al., Citation2018) and (Huang et al., Citation2019) for different codebook sizes as shown in Tables and . Since secret data is embedded by modifying the index table in the first two schemes, embedding capacities for these schemes are relatively low. Besides, modification of the index value changes all pixel values in the whole reconstructed image block and thus severely degrades the visual quality of stego-image. Huang et al.’s adaptive embedding scheme can hide secret data with good PSNR performance. However, the proposed scheme can embed much more data while maintaining an acceptable visual quality of the stego-images. At very low embedding rate, our scheme suffers from insufficient error compensation and PSNR cannot be further improved.

Table 7. Comparison with related works (codebook size: 256).

Download CSV Display Table

Table 8. Comparison with related works (codebook size: 512).

Download CSV Display Table

4.4. Steganalysis

To know how the security of the proposed scheme compares against a pixel value differencing steganalysis (Zhang et al., Citation2004), the pixel value differencing histograms are given in Figure . An analysis of two typical test images, each with three VQ images of different codebook sizes, are presented. In comparison with the VQ-decompressed images, the histogram of the fully embedded stego-image is much closer to that of the cover image. The data embedding error compensation effect is very significant.

Figure 12. The PDH analysis of the stego-images and VQ-decompressed images.

The RS steganalysis (Fridrich et al., Citation2002) was also applied to assess the security of our scheme. The analysed results are shown in Table . Again, the fully embedded stego-images for three different codebook sizes of two test images are analysed. The image name “128-Baboon-stego” represents the fully embedded Baboon image with a codebook size of 128; while “256-Baboon-decom” is the VQ-decompressed Baboon image with a codebook size of 256. The experimental data indicates that the stego-images are robust under RS steganalysis.

Table 9. The RS steganalysis of stego-images and VQ-decompressed images.

Display Table

5. Conclusions

We propose a VQ-oriented data hiding scheme based on an error compensation strategy. By compensating for the error between the VQ decompressed image and the original image, the resulting stego-image shrinks the error in low embedding rates. However, it may over compensate in high embedding rates. Besides, the proposed scheme is not advantageous in comparison with the related work (Huang et al., Citation2019) at low embedding rates.

Referring to the experimental data, the stego-image is closer to the original image than the VQ decompressed version in a low PSNR case. As the codebook size is reduced to an even lower size, the proposed scheme can embed a greater amount of secret data with improved visual quality in comparison with the VQ image before data hiding.

Although the proposed VQ-oriented data hiding scheme can achieve a high embedding capacity and successfully compensate the approximation error of the VQ-decompressed image, the compensation effect is significant only when the VQ-based image largely deviates from the cover image. A more sophisticated and adaptive design is required to gain a better stego-image than the current scheme.

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

Alrehily, A., & Thayananthan, V. (2018). Computer security and software watermarking based on return-oriented programming. International Journal of Computer Network and Information Security, 5(5), 28–36. https://doi.org/10.5815/ijcnis.2018.05.04
Google Scholar
Artz, D. (2001). Digital steganography: Hiding data within data. IEEE Internet Computing, 5(3), 75–80. https://doi.org/10.1109/4236.935180
Web of Science ®Google Scholar
Carpentieri, B., Castiglione, A., Santis, A. D., Palmieri, F., & Pizzolante, R. (2019). One-pass lossless data hiding and compression of remote sensing data. Future Generation Computer Systems, 90, 222–239. https://doi.org/10.1016/j.future.2018.07.051
Web of Science ®Google Scholar
Chang, C. C., & Hu, Y. C. (1998). A fast codebook training algorithm for vector quantization. IEEE Transactions on Consumer Electronics, 44(4), 1201–1208. https://doi.org/10.1109/30.735818
Web of Science ®Google Scholar
Chang, C. C., & Li, C. T. (2019b). Algebraic secret sharing using privacy homomorphisms for IoT-based healthcare systems. CDATA[Mathematical Biosciences and Engineering, 16(5), 3367–3381. https://doi.org/10.3934/mbe.2019168
PubMed Web of Science ®Google Scholar
Chang, C. C., Li, C. T., & Chen, K. (2019a). Privacy-preserving reversible information hiding based on arithmetic of quadratic residues. IEEE Access, 7, 54117–54132. https://doi.org/10.1109/ACCESS.2019.2908924
Web of Science ®Google Scholar
Chang, C. C., Li, C. T., & Shi, Y. Q. (2018). Privacy-aware reversible watermarking in cloud computing environments,”. IEEE Access, 6, 70720–70733. https://doi.org/10.1109/ACCESS.2018.2880904
Web of Science ®Google Scholar
Chang, C. C., Liu, Y., & Nguyen, T. S. (2014). A novel turtle shell based scheme for data hiding. Proceedings of 2014 International conference on Intelligent Information Hiding and Multimedia Signal processing, Kita Kyushu, Japan, 27–29 August 2014, 89–93.
Google Scholar
Chen, Y. H., Chang, C. C., & Hsu, C. Y. (2020). Content-based image retrieval using block truncation coding based on edge quantization. Connection Science, 32(4), 431–448. https://doi.org/10.1080/09540091.2020.1753174
Web of Science ®Google Scholar
Cheng, Q., & Huang, T. S. (2001). An additive approach to transform-domain information hiding and optimum detection structure. IEEE Transactions on Multimedia, 3(3), 273–284. https://doi.org/10.1109/6046.944472
Web of Science ®Google Scholar
Du, W. C., & Hsu, W. J. (2003). Adaptive data hiding based on VQ compressed images. IEE Proceedings of vision, image and Signal processing, 150(4), August, 233–238.
Google Scholar
Fridrich, J., & Goljan, M. (2002). Practical steganalysis of digital images: State of the art. Proceedings of SPIE, 4675, 1–13. https://doi.org/10.1117/12.465263
Google Scholar
Hu, Y. C., & Chang, C. C. (2003). An effective codebook search algorithm for vector quantization. The Imaging Science Journal, 51(4), 221–233. https://doi.org/10.1080/13682199.2003.11784428
Web of Science ®Google Scholar
Hu, Y. C., Su, B. H., & Tsou, C. C. (2008). Fast VQ codebook search algorithm for grayscale image coding. Image and Vision Computing, 26(5), 657–666. https://doi.org/10.1016/j.imavis.2007.08.001
Web of Science ®Google Scholar
Huang, C. T., Lin, L. C., Yang, C. H., & Wang, S. J. (2019). Dynamic embedding strategy of VQ-based information hiding approach. Journal of Visual Communication and Image Representation, 59, 14–32. https://doi.org/10.1016/j.jvcir.2018.12.018
Web of Science ®Google Scholar
Huang, C. T., Tsai, M. Y., Lin, L. C., Wang, W. J., & Wang, S. J. (2018). VQ-based data hiding in IoT networks using two-level encoding with adaptive pixel replacements. The Journal of Supercomputing, 74(9), 4295–4314. https://doi.org/10.1007/s11227-016-1874-9
Web of Science ®Google Scholar
Khan, A., Siddiqa, A., Munib, S., & Malik, S. A. (2014). A recent survey of reversible watermarking techniques. Information Sciences, 279(20), 251–272. https://doi.org/10.1016/j.ins.2014.03.118
Web of Science ®Google Scholar
Lai, J. C., & Liaw, Y. C. (1996). Fast searching algorithm for VQ codebook generation. Journal of Visual Communication and Image Representation, 7(2), 163–168. https://doi.org/10.1006/jvci.1996.0016
Web of Science ®Google Scholar
Lee, C. F., Chang, C. C., Shih, C. S., & Agrawal, S. (2019). A survey of data hiding based on vector quantization. Advances in Intelligent Information Hiding and Multimedia Signal Processing, Smart Innovation, Systems and Technologies, 156, 65–72. https://doi.org/10.1007/978-981-13-9714-1_7
Google Scholar
Lin, C. C., Chen, S. C., & Hsueh, N. L. (2009). Adaptive embedding techniques for VQ-compressed images. Information Sciences, 179(1-2), 140–149. https://doi.org/10.1016/j.ins.2008.09.001
Web of Science ®Google Scholar
Lin, Y. C., & Tai, S. C. (1998). A fast Linde-Buzo-Gray algorithm in image vector quantization. IEEE T. Circuits and Systems II: Analog and Digital Signal Processing, 45(3), 432–435. https://doi.org/10.1109/82.664257
Google Scholar
Linde, Y., Buzo, A., & Gray, R. M. (1980). An algorithm for vector quantizer design. IEEE Transactions on Communications, 28(1), 84–95. https://doi.org/10.1109/TCOM.1980.1094577
Web of Science ®Google Scholar
Lu, T. C., & Chang, C. C. (2009). An improved full-search scheme for the vector quantization algorithm based on triangle inequality. International Journal of Innovative Computing, Information and Control, 5(6), 1625–1632.
Web of Science ®Google Scholar
Manohar, K., & Kieu, T. D. (2018). An SMVQ-based reversible data hiding technique exploiting side match distortion. Multimedia Tools and Applications, 77(10), 11727–11750. https://doi.org/10.1007/s11042-017-4814-7
Web of Science ®Google Scholar
Nasrabadi, N. M., & King, R. A. (1988). Image coding using vector quantization: A review. IEEE Transactions on Communications, 36(8), 957–971. https://doi.org/10.1109/26.3776
Web of Science ®Google Scholar
Pan, Z., & Wang, L. (2018). Novel reversible data hiding scheme for two-stage VQ compressed images based on search-order coding. Journal of Visual Communication and Image Representation, 50, 186–198. https://doi.org/10.1016/j.jvcir.2017.11.020
Web of Science ®Google Scholar
Pizzolante, R., Castiglione, A., Carpentieri, B., Santis, A. D., & Palmieri, F., & Castiglione, A. (2018). A. On the protection of consumer genomic data in the Internet of living things. Computers & Security, 74, 384–400. https://doi.org/10.1016/j.cose.2017.06.003
Web of Science ®Google Scholar
Qin, C., & Hu, Y. C. (2016). Reversible data hiding in VQ index table with lossless coding and adaptive switching mechanism. Signal Processing, 129, 48–55. https://doi.org/10.1016/j.sigpro.2016.05.032
Web of Science ®Google Scholar
Rahmani, P., & Dastghaibyfard, G. (2018a). Two reversible data hiding schemes for VQ-compressed images based on index coding. IET Image Processing, 12(7), 1195–1203. https://doi.org/10.1049/iet-ipr.2016.0618
Web of Science ®Google Scholar
Rahmani, P., & Dastghaibyfard, G. (2018b). An efficient histogram-based index mapping mechanism for reversible data hiding in VQ-compressed images. Information Sciences, 435, 224–239. https://doi.org/10.1016/j.ins.2017.12.041
Web of Science ®Google Scholar
Rahmani, P., Norouzzadeh, M. S., & Dastghaibyfard, G. (2016). A novel legitimacy preserving data hiding scheme based on LAS compressed code of VQ index tables. Multidimensional Systems and Signal Processing, 27(2), 433–452. https://doi.org/10.1007/s11045-014-0309-0
Web of Science ®Google Scholar
Schafer, B., & Lilian, E. (2017). “I spy, with my little sensor”: fair data handling practices for robots between privacy, copyright and security. Connection Science, 29(3), 200–209. https://doi.org/10.1080/09540091.2017.1318356
Web of Science ®Google Scholar
Wang, Y., Gong, D., Lu, B., Xiang, F., & Liu, F. (2018). Exception handling-based dynamic software watermarking. IEEE Access, 6, 8882–8889. https://doi.org/10.1109/ACCESS.2018.2810058
Web of Science ®Google Scholar
Xiao, T., Han, D., He, J., Li, K. C., & de Mello, R. F. (2021). Multi-Keyword ranked search based on mapping set matching in cloud ciphertext storage system. Connection Science, 33(1), 95–112. https://doi.org/10.1080/09540091.2020.1753175
Web of Science ®Google Scholar
Xu, S. Y., Chang, C. C., & Liu, Y. J. (2021). A novel image compression technology based on vector quantisation and linear regression prediction. Connection Science, 1–18. https://doi.org/10.1080/09540091.2020.1806206
Web of Science ®Google Scholar
Yang, C., Weng, C., Wang, S., & Sun, H. (2008). Adaptive data hiding in edge areas of images with spatial LSB domain systems. IEEE Transactions on Information Forensics and Security, 3(3), 488–497. https://doi.org/10.1109/TIFS.2008.926097
Web of Science ®Google Scholar
Ying, L., Qiqi, L., Jiulun, F., Fuping, W., Jianlong, F., Qingan, Y., Kiang, C. T., & Nam, L. (2021). Tyre pattern image retrieval–current status and challenges. Connection Science, 1–19. https://doi.org/10.1080/09540091.2020.1806207
Google Scholar
Zhang, X., & Wang, S. (2004). Vulnerability of pixel-value differencing steganography to histogram analysis and modification for enhanced security. Pattern Recognition Letters, 3(25), 331–339. https://doi.org/10.1016/j.patrec.2003.10.014
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

VQ-oriented data hiding based on adjustable error compensation strategy

Abstract

1. Introduction