Embedding Secret Data in a Vector Quantization Codebook Using a Novel Thresholding Scheme

Lin, Yijie; Liu, Jui-Chuan; Chang, Ching-Chun; Chang, Chin-Chen

doi:10.3390/math12091332

Open AccessArticle

Embedding Secret Data in a Vector Quantization Codebook Using a Novel Thresholding Scheme

¹

Department of Information Engineering and Computer Science, Feng Chia University, Taichung 40724, Taiwan

²

Information and Communication Security Research Center, Feng Chia University, Taichung 40724, Taiwan

^*

Author to whom correspondence should be addressed.

Mathematics 2024, 12(9), 1332; https://0-doi-org.brum.beds.ac.uk/10.3390/math12091332

Submission received: 27 March 2024 / Revised: 16 April 2024 / Accepted: 18 April 2024 / Published: 27 April 2024

(This article belongs to the Special Issue Advances in Mathematical Cryptography and Information Security toward Industry 5.0)

Download

Browse Figures

Versions Notes

Abstract

:

In recent decades, information security has become increasingly valued, including many aspects of privacy protection, copyright protection, and digital forensics. Therefore, many data hiding schemes have been proposed and applied to various carriers such as text, images, audio, and videos. Vector Quantization (VQ) compression is a well-known method for compressing images. In previous research, most methods related to VQ compressed images have focused on hiding information in index tables, while only a few of the latest studies have explored embedding data in codebooks. We propose a data hiding scheme for VQ codebooks. With our approach, a sender XORs most of the pixel values in a codebook and then applies a threshold to control data embedding. The auxiliary information generated during this process is embedded alongside secret data in the index reordering phase. Upon receiving the stego codebook and the reordered index table, the recipient can extract the data and reconstruct the VQ-compressed image using the reverse process. Experimental results demonstrate that our scheme significantly improves embedding capacity compared to the most recent codebook-based methods. Specifically, we observe an improvement rate of 223.66% in a small codebook of size 64 and an improvement rate of 85.19% in a codebook of size 1024.

Keywords:

Vector Quantization; data hiding; codebook; threshold

MSC:

68P27

1. Introduction

With the widespread usage of digital media in daily life, protecting the security and privacy of data has become increasingly important. Artificial intelligence-based watermarking [1], cryptography systems [2], and data hiding [3,4,5,6,7,8] have become effective means of safeguarding confidential data and personal privacy at a time when AI is rapidly evolving. These new techniques are used in many fields, such as privacy protection, copyright protection, and digital forensics.

Vector Quantization (VQ) [9] is a data compression technology. The basic principle of VQ-compressed images is to divide the image into non-overlapping pixel blocks, and then treat each pixel block as a vector. An algorithm, such as the Linde–Buzo–Gray (LBG) algorithm [10] or another improved algorithm [11], is used to train images into a codebook. The image’s pixel blocks are then mapped to the closest codeword in the codebook, and the indices of these codewords are stored as an index table. During decompression, the VQ compressed image can be reconstructed based on the stored codebook and the index table. In recent years, data hiding in the compression domain of VQ-compressed images has been an active area of research. This technique not only achieves image compression but also embeds data into the compression domain to enhance data security. Most of this research employed techniques to hide data within the index tables [12,13,14,15,16], while some recent studies explored methods for hiding data within the codebooks [17,18]. In 2023, Liu et al. [17] proposed a novel approach that focused on data embedding and extraction through sorting and reordering codewords of a VQ codebook. When a codebook size was 64, they achieved a bit rate of 0.2578 bits per pixel (bpp). Building upon this work, in 2024, Chang et al. [18] introduced a method for data embedding by employing pairwise adjustments of pixels within codewords. When a codebook size was 64, their method achieved a higher bit rate of 0.5820 bpp, indicating improved performance in terms of data capacity.

The key carrier of the proposed scheme is a VQ codebook which is not a common medium for data embedding due to its size. Even though its embedding capacity is relatively small compared to that of a cover image, it is very interesting to find ways to improve the capacity. Therefore, we propose a scheme that utilizes a threshold. First, codewords are preprocessed to reduce their pixel values through XOR operations. Then, data embedding is performed by looking up a rule table. Compared with the previous methods, it not only significantly improves the embedding capacity but also allows adapting to different situations by adjusting of the threshold to meet different requirements.

This particular study makes several significant contributions, as follows:

Compared with a more recent compatible method, the embedding capacity is significantly improved. On a codebook of size 64, the embedding bit rate is 1.8838 bpp, which makes the improvement rate go as high as 223.66%. Even for a codebook of size 1024, the improvement rate can be as high as 85.19%.
Our proposed scheme provides an adjustable threshold that can be adjusted to suit various requirements and reflects the flexibility of our approach.
Our proposed scheme can losslessly reconstruct a VQ-compressed image. Achieving a PSNR of +∞ between a VQ-compressed image and the reconstructed VQ-compress image using the original codebook indicates that the two VQ-compressed images are exactly the same.

In Section 2, we introduce some methods related to our proposed scheme. Section 3 describes the details of data embedding and the reverse process of our proposed scheme. Experimental results demonstrate the good performance of our proposed scheme in Section 4. Finally, in Section 5, our proposed scheme is concluded.

2. Related Work

In this section, we review some technologies related to our scheme. In Section 2.1, we introduce the training method of the VQ codebook using the LBG algorithm [10]. Section 2.2 introduces the VQ codeword index reordering scheme proposed by Liu et al. [17] in 2023. In Section 2.3, we discuss the extended run-length encoding compression algorithm proposed by Chen and Chang [19] back in 2019.

2.1. Vector Quantization Codebook Training

VQ is a well-known lossy data compression technique. It is also called “block quantization” because media are divided into blocks first. A simple VQ training algorithm is as follows:

(a): Pick a random sample point $P$
(b): Let the distance between $P$ and a VQ centroid $c_{i}$ be $d (P, c_{i})$
(c): Find the VQ centroid $c_{i}$ with the shortest distance
(d): Repeat

Figure 1 demonstrates the process of codebook creation.

Linde–Buzo–Gray (LBG) [10] is an algorithm designed based on k-means clustering in 1980 and is frequently used to train VQ codebooks. The process involves collecting a small set of vectors to represent a large set of vectors through iterations. The large set is known as a training set, and the small set collected is called a codebook. Lossy compressed images can then be reconstructed using the codebook and an index table.

2.2. VQ Codeword Index Reordering

Liu et al. [17], in 2023, used an index reordering technique to embed data in well-trained codebooks. The codebook embedding capacity is highly dependent on the size of a codebook. In order to extract embedded data and recover the original codebook from a stego codebook, the codebook needs to be sorted before manipulating the index orders to hide data.

2.2.1. Codebook Sorting

There are different ways of ordering a codebook. Liu et al. simply used a line, projected the codebook vectors to the line to gain the project values, and obtained a sorted codebook by sorting these projected values. Their experiment results showed that the sorting method was effective for the purpose of retrieving the original sorted codebook from a stego codebook. The sorted codebook is then employed to encode/decode images for the compression and recovery process.

2.2.2. VQ Codebook Data Embedding

The embedding capacitor can be calculated by the size of a sorted codebook

C B

. If the size of a sorted codebook is

{s i z e}_{C B}

, the total embedding capacity

E C

is calculated by Equation (1), where

i

represents the index being processed.

E C = \sum_{i = 1}^{{s i z e}_{C B}} ⌊\log_{2} i⌋

(1)

Figure 2 illustrates a simple example of how a codebook embeds data, assuming there is secret data

S = {1011000010010}

in its binary form and the size of a sorted codebook

C B

is 8. Codewords are single dimension values defined as

{c w}_{0} = \{0\}, {c w}_{1} = \{1\}, \dots, {c w}_{7} = \{7\}

. The stego codebook

S C B

is generated after data embedding. In our example, the first embeddable bit count is

e_{8} = ⌊\log_{2} 8⌋ = 3

. The first three bits of secret data

S

,

{(101)}_{2} = {(5)}_{10}

will be embedded in the first round. Take the converted decimal value five as the index value, the codeword

{c w}_{5}

of the original codebook

C B

to the stego codebook

S C B

as

{s c w}_{0}

. After the embedding, the indices of

C B

are adjusted accordingly. Since there are only 7 codewords left in

C B

, the embeddable bit count is now

e_{7} = ⌊\log_{2} 7⌋ = 2

. Take the value of the fourth bit and the fifth bit and convert them to decimal value

{(10)}_{2} = {(2)}_{10}

. The second codeword

{c w}_{2}

is moved from

C B

to

S C B

as

{s c w}_{1}

. The process continues until all secret bits are embedded or there are no more codewords in the original codebook.

2.2.3. VQ Codebook Data Extraction

To recover the data from the stego codebook, the same sorting algorithm is used to sort the stego codebook and generate a sorted codebook

C W

as a reference for data extraction. By the order of the codewords in the stego codebook, the data embedding order is determined. By matching a codeword from the stego codebook and the sorted reference codebook, the embedded data is the index of the codeword in the sorted reference codebook. Convert the index to binary format and append it to the recovered secret data. Remove the matched codeword from both the stego codebook and the sorted reference codebook for continuous recovery stages. Repeat until there are no more codewords in the stego codebook.

Figure 3 explains the data extraction portion using the stego codebook generated from Figure 2. Sort the stego codebook

S C B

after receiving and obtaining the sorted reference codebook

C B

. Match the first codeword

{s c w}_{0}

of

S C B

in

C B

and obtain the index

i = 5

from

C B

. Convert into binary format

{(5)}_{10} = {(101)}_{2}

and append it to the recovered secret

S = {101}

. Remove the codeword from both codebooks before the next round of extraction. Repeat the process of matching the first codeword

{s c w}_{0}

of

S C B

in

C B

and obtain the index of the matched codeword from

C B

. The secret data are extracted. The extracting process ends when there are no codewords in the stego codebook

S C B

.

2.3. Extended Run-Length Encoding

Run-length encoding (RLE) is an effective lossless data compression technology. The run length represents the length of a sequence of consecutive identical symbols. Compression can be achieved by storing symbols and their respective quantities. Since its inception, there have been numerous improvements in RLE-based techniques. In 2019, Chen and Chang proposed the Extended Run-Length Encoding (ERLE) algorithm [19] which categorizes input into two types, fixed-length codewords and variable-length codewords, to further enhance compression performance.

When a run-length

L_{r} < 4

, the ERLE algorithm defines it as a fixed-length codeword; conversely, when a run-length

L_{r} \geq 4

, it is a variable-length codeword. For fixed-length codewords, the prefix length code is recorded as 0, and then the fixed length

L_{f i x}

bits are directly scanned afterward from the current position. For variable-length codewords, the prefix length code

L_{p r e}

is first calculated using Equation (2). The prefix encoding begins with a 1 in the

(L_{p r e} - 1)

bit and ends with a 0. The length of the length symbol is the same as the prefix code, and the value of the length symbol

{S y m b o l}_{l e n g t h}

is as shown in Equation (3). Finally, one bit is used to represent consecutive identical symbols. The sequence is processed differently based on the codeword type and the sequence compressed by the ERLE algorithm is ultimately obtained.

L_{p r e} = ⌊{l o g}_{2} (L_{r})⌋

(2)

{S y m b o l}_{l e n g t h} = L_{r} - 2^{L_{p r e}}

(3)

The same process applies in the decompression stage. First, the prefix code is observed to determine the codeword type. If the prefix code starts with 0, it indicates it is a fixed-length codeword, and the fixed-length

L_{f i x}

bits need to be scanned afterward. If the prefix code starts with one, it indicates it is a variable-length codeword. The scan needs to continue until it reaches 0, and the length from the starting position to the end position determines the length of the prefix code,

L_{p r e}

. Then, scan the following

L_{p r e}

bits to obtain the binary sequence of the length symbol

{S y m b o l}_{l e n g t h}

, convert it to decimal, and calculate using Equation (4) to obtain the run length,

L_{r}

. Finally, read the following bit to represent the consecutive identical symbols. After decompression is complete, the original sequence can be restored losslessly.

L_{r} = {2^{L_{p r e}} + S y m b o l}_{l e n g t h}

(4)

Figure 4 presents a specific example of

L_{f i x} = 4

. In Figure 4a, input the original sequence first, and consecutively scan identical symbols. When

L_{r} = 2

, bit length 2 is less than 4 and 4 bits are scanned, which results in 0010 with 0 as the prefix code. Continue to scan to identify consecutive identical symbols resulting in

L_{r} = 13

. Since 13 is larger than 4,

L_{p r e}

is calculated as

⌊{l o g}_{2} (13)⌋ = 3

, and

{S y m b o l}_{l e n g t h}

is then computed as

13 - 2^{3} = 5

represented by 3 bits. The last added bit represents the repeated symbol 1. Figure 4b illustrates the decompression phase. Initially, 0 is scanned, indicating a fixed-length codeword, and the following 4 bits are scanned. The scan continues until it reaches a 1, then stops when it reaches a 0, yielding

L_{p r e} = 3

. Subsequently, 3 bits are scanned resulting in

{S y m b o l}_{l e n g t h} = 5

. Thus,

L_{r}

is calculated as

2^{3} + 5 = 13

. Finally, 1 bit is scanned and a symbol 1 is obtained, indicating that the original sequence comprises 13 consecutive 1s.

3. Proposed Scheme

In this section, we introduce a novel data hiding scheme in VQ codebooks. The preprocessing phase, which reduces the pixel values of codewords through XOR operations, is described in Section 3.1. Section 3.2 details the data embedding phase, in which we utilize an adjustable threshold to flexibly accommodate different requirements. Section 3.3 outlines the data extraction and image recovery phase after a receiver receives the stego codebook and the reordered index table. Figure 5 shows the flow of our proposed scheme.

3.1. Preprocessing Phase

The sender begins a codebook training process using the original images by employing the LBG algorithm [10]. This process entails partitioning the original images into

4 \times 4

blocks and iteratively refining a codebook to accurately represent these blocks of data. The VQ codebook

C B

and the corresponding index tables

I T

for VQ compressed images are generated after the process. Since the VQ codebook exhibits a characteristic that most pixel values in a codeword are very similar, we utilize XOR to condense these values to assist in achieving better embedding results. In Equation (5), where

i

represents the current codeword and

j

represents the location of pixel

p

in the current codeword, we preserve the first pixel of the codeword

i

as the reference pixel and leave its value unchanged. Meanwhile, we XOR the remaining pixels with the first one to derive the preprocessed pixel values, denoted as

p p

. Figure 6 presents an example of four codewords in a codebook. It fulfills our requirement that the values of other pixels mostly become very small after XORing with the first pixel.

{p p}_{i j} = \{\begin{matrix} p_{i j}, \\ p_{i j} \oplus p_{i 1}, \end{matrix} \begin{matrix} i f j = 1 \\ o t h e r w i s e \end{matrix}

(5)

3.2. Data Embedding Phase

Our scheme supports adjustable embedding rules by setting different thresholds to flexibly adapt to various application scenarios. We utilize encoding rules similar to prefix codes to establish correspondence between the data and the label. Figure 7 illustrates encoding rules for different thresholds, where

t

represents the threshold,

l

represents the label, and

d

represents the data.

To ensure successful recovery of the codeword, the first pixel within each codeword, acting as a reference, must remain unchanged. Before utilizing the embedding process

{s p}_{i j} = p p_{i j} \times t + l

to embed data, it is essential to first ascertain whether the embedded pixel value,

{s p}_{i j}

, will cause an overflow. If an overflow is anticipated, the embedding is deemed unacceptable, and the pixel value remains unchanged. Otherwise, embedding can proceed. Equation (6) encapsulates this mathematical process, where

s p

represents the stego pixel,

p p

represents the preprocessed pixel,

i

and

j

represent the positions of the pixel in the codebook, and

t

is the threshold specified.

{s p}_{i j} = \{\begin{matrix} p p_{i j}, \\ p p_{i j} \times t + l, \end{matrix} \begin{matrix} i f j = 1 o r (p p_{i j} \times t + l) > 255 \\ o t h e r w i s e \end{matrix}

(6)

While calculating

s p

, we need to check whether the output

s p

conforms to the input rules of

p p

to ensure correct recovery. In terms of mathematical formulas, it is necessary to check the

s p

of all non-reference pixels (

j \neq 1

). If

{s p}_{i j} \times t \in [0,255]

, it means that it is naturally known that it can be embedded without additional auxiliary information. If

{s p}_{i j} \times t > 255

, further checking shown in Equation (7) is required. If it is not embeddable, let the indicator

{i n d}_{k} = 0

be saved as auxiliary information; otherwise, if it is embeddable, let the indicator

{i n d}_{k} = 1

, where

k

is the current indicator, and

k

is automatically incremented every time an indicator is added. Finally, we obtain a binary sequence composed of indicators. Because most preprocessed pixel values in the codebook are relatively close, there will be many consecutive 0’s or consecutive 1’s in the binary sequence. We can utilize the ERLE algorithm [19] for auxiliary information compression to conserve capacity and allocate more capacity for embedded data.

{i n d}_{k} = \{\begin{matrix} 0, \\ 1, \end{matrix} \begin{matrix} i f (p p_{i j} \times t + l) > 255 \\ o t h e r w i s e \end{matrix}

(7)

Figure 8 provides a codeword example that covers all embedding cases when

t = 3

. In this codeword, the first pixel (

j = 1

) circled in yellow serves as the reference pixel which remains unchanged. The pixel circled in orange is a case of non-embeddable because its pixel value would overflow after embedding. To avoid confusion during the data extraction and image recovery phases for the non-embeddable case, auxiliary information

{i n d}_{k} = 0

is required to be recorded. Circles in light green and dark green indicate pixels are embeddable, and Equation (6) is the mathematical formula for embedding. Taking the row in light green, when

t = 3

and the data

d

embedded is 00, the corresponding label

l = 0

according to Figure 7; therefore, the embedded pixel value

s p = 60 \times 3 + 0 = 180

. Similarly, in the dark green row, the corresponding label

l = 1

when the data

d

embedded is 01, so

s p = 8 \times 3 + 1 = 25

. The light green row indicates that it needs to check whether the output obeys the input rules,

{s p}_{i j} \times t > 255

. To prevent confusion with non-embeddable situations, it is necessary to use

{i n d}_{k} = 1

as the auxiliary information. Finally, the dark green row indicates that the embedded pixel can be known directly without any auxiliary information.

After our scheme completes embedding, the stego codebook is sorted using Liu et al.’s method [17] and then more data can be embedded using VQ reordering. It should be noted that there are very important technical details in the process. During embedding, we first retain the reference pixels and record indicators for non-embeddable pixels. Then, we embed the embeddable pixel, represented by light green in Figure 8, which requires an indicator. At this point, a portion of the secret data has been embedded and all indicators have been obtained. After applying ERLE algorithm compression, we embed the auxiliary information as data into pixels that can be directly known to be embeddable, represented by dark green in Figure 8. If the capacity does not support embedding all auxiliary information, the remaining auxiliary information can also be embedded using VQ reordering. If there is remaining capacity, additional secret data can be embedded. Eventually, the sender will transmit the stego codebook and the reordered index table to the receiver.

Algorithm 1 describes the process of our proposed scheme in the data embedding phase, outlining how data is encoded and embedded into the stego data.

Algorithm 1 Data Embedding
Input	Preprocessed codebook $P C B$ , index table $I T$ , secret $S$ , and threshold $t$ .
Output	Stego codebook $S C B$ and reordered index table $R I T$ .
Step 1	Preserve reference pixels and pixels causing overflow. Hide secret data in pixels flagged by indicators according to encoding rules. Record all indicators. $k = 1$ ; for $i \in {1, 2, \dots, {s i z e}_{P C B}}$ for $j \in {1, 2, \dots, 16}$ Obtain the label $l$ according to the encoding rules. if $j = 1$ or $({p p}_{i j} \times t + l) > 255$ ${s p}_{i j} = {p p}_{i j}$ ; else if ${s p}_{i j} \times t > 255$ ${s p}_{i j} = {p p}_{i j} \times t + l$ ; if $({p p}_{i j} \times t + l) > 255$ indicator ${i n d}_{k} = 0$ ; else indicator ${i n d}_{k} = 1$ ; end if $k = k + 1$ ; end if end if end for end for
Step 2	Compress the indicator sequence using the ERLE algorithm to obtain compressed auxiliary information.
Step 3	Similar to Step 1, but here, $l$ is first used for auxiliary information. Any remaining space can be used to embed the secret. for $i \in {1, 2, \dots, {s i z e}_{P C B}}$ for $j \in {1, 2, \dots, 16}$ Obtain the label $l$ according to the encoding rules. if $j = 1$ or $({p p}_{i j} \times t + l) > 255$ ${s p}_{i j} = {p p}_{i j}$ ; else if ${s p}_{i j} \times t \leq 255$ ${s p}_{i j} = {p p}_{i j} \times t + l$ ; end if end if end for end for
Step 4	Utilize reordering for additional data embedding. Embed any remaining auxiliary information not fully embedded in Step 3, and use available spaces to embed secret data.
Step 5	Output stego codebook $S C B$ and reordered index table $R I T$ .

3.3. Data Extraction and Image Recovery Phase

When the receiver has a stego codebook and a reordered index table, the stego codebook must first be reordered to extract data, which may include both secret data and auxiliary information. Then, it proceeds to extract data from pixel values that can be directly identified as embedded, which may also include both secret data and auxiliary information. After extracting all auxiliary information, ERLE algorithm decompression is applied to retrieve all indicators. Subsequently, all pixels other than references are examined with

s p_{i j} \times t > 255

. If

{i n d}_{k} = 0

, the pixel remains unchanged. If

{i n d}_{k} = 1

, data extraction occurs. The mathematical formulas for data extraction are depicted in Equations (8) and (9). It is evident that when pixels are references or non-embedded ones, they remain unchanged. Otherwise, they are embedded pixels, and extraction operations are executed, denoted as

{r p}_{i j} = ⌊s p_{i j} \div t⌋

, where

r p p

represents the pixel of the reordered preprocessed codebook. Subsequently, the label

l = {s p_{i j} - r p}_{i j} \times t

can be calculated, and then the data can be extracted by referring to the encoding rules depicted in Figure 7.

{r p p}_{i j} = \{\begin{matrix} s p_{i j}, \\ ⌊s p_{i j} \div t⌋, \end{matrix} \begin{matrix} i f j = 1 o r (s p_{i j} \times t > 255 a n d {i n d}_{k} = 0) \\ o t h e r w i s e \end{matrix}

(8)

l = {s p_{i j} - r p p}_{i j} \times t

(9)

Figure 9 provides an example of the data extraction phase in a codeword when

t = 3

. Circled in yellow are the reference pixels, which remain unchanged. Subsequently, the other pixels are checked. If

s p_{i j} \times t > 255

, then the indicator is further examined. If

{i n d}_{k} = 0

, it indicates non-embeddable pixels, which remain unchanged and are represented in orange in the figure. If

{i n d}_{k} = 1

, it indicates the presence of embedding. For the pixels circled in light green in the picture, first, calculate

{r p p}_{i j} = ⌊180 \div 3⌋ =

60, then compute the label

l = 180 - 60 \times 3 = 0

, and subsequently refer to the encoding rules to extract the data as 00. Similarly, the same applies to pixels directly identified as embedded, represented in dark green. First, calculate

{r p p}_{i j} = ⌊25 \div 3⌋ =

8, then compute the label

l = 25 - 8 \times 3 = 1

. According to the encoding rules, the extracted data is 01.

After the data is extracted, we obtain the reordered preprocessed codebook. Subsequently, we only need to perform post-processing to restore the pixels. As shown in Equation (10), the reference pixel remains unchanged, while the remaining pixels and the first pixel are XORed to obtain the recovered pixel,

r p

. After processing all pixels, the receiver can obtain the recovered codebook, and then compare the reordered index table with it to recover the VQ compressed image. After processing all pixels, the receiver can obtain the recovered codebook. Then, the receiver compares the reordered index table with it to reconstruct the VQ compressed image.

{r p}_{i j} = \{\begin{matrix} {r p p}_{i j}, \\ {r p p}_{i j} \oplus {r p p}_{i 1}, \end{matrix} \begin{matrix} i f j = 1 \\ o t h e r w i s e \end{matrix}

(10)

Algorithm 2 describes the process of our proposed scheme in the data extraction phase, outlining how data is decoded and extracted from the stego data, and then recovers the codebook.

Algorithm 2 Data Extraction
Input	Stego codebook $S C B$ , reordered index table $R I T$ , and threshold $t$ .
Output	Secret $S$ and reordered codebook $R C B$ .
Step 1	Use reordering of the stego codebook $S C B$ to extract data.
Step 2	Keep reference pixels unchanged and recover the pixels that can be directly known to be embedded and extract data. for $i \in {1, 2, \dots, {s i z e}_{S C B}}$ for $j \in {1, 2, \dots, 16}$ if $j = 1$ ${r p p}_{i j} = {s p}_{i j}$ ; else if ${s p}_{i j} \times t \leq 255$ ${r p p}_{i j} = ⌊s p_{i j} \div t⌋$ ; $l = {s p_{i j} - r p p}_{i j} \times t$ ; Obtain the data according to the encoding rules. end if end if end for end for
Step 3	Similar to Step 2, the situation of $s p_{i j} \times t > 255$ needs to be judged based on the indicator. for $i \in {1, 2, \dots, {s i z e}_{S C B}}$ for $j \in {1, 2, \dots, 16}$ if $j = 1$ ${r p p}_{i j} = {s p}_{i j}$ ; else if ${s p}_{i j} \times t > 255$ if ${i n d}_{k} = 0$ ${r p p}_{i j} = {s p}_{i j}$ ; else ${r p p}_{i j} = ⌊s p_{i j} \div t⌋$ ; $l = {s p_{i j} - r p p}_{i j} \times t$ ; Obtain the data according to the encoding rules. end if Obtain the secret $S$ and the reordered preprocessed codebook $R P C B$ . end if end if end for end for
Step 4	Post-processing for reordered preprocessed codebook $R P C B$ for $i \in {1, 2, \dots, {s i z e}_{S C B}}$ for $j \in {1, 2, \dots, 16}$ if $j = 1$ ${r p}_{i j} = {r p p}_{i j}$ ; else ${r p}_{i j} = {r p p}_{i j} \oplus {r p p}_{i 1}$ ; end if end for end for Obtain the recovered codebook $R C B$ .
Step 5	Output secret $S$ and recovered codebook $R C B$ .

4. Experimental Results

This section shows the experimental results of our proposed scheme. Figure 10 displays six grayscale images of size 512 × 512 used as test images, trained to generate a VQ codebook. Our experimental environment runs on a Windows 11 operating system with an AMD Ryzen 7 5800H CPU with Radeon Graphics, 16 GB of memory, and MATLAB R2023B software.

Table 1 presents comparisons of PSNR values between a reconstructed image and its original image (OI), as well as between the reconstructed image and its VQ image (VI) reconstructed using the original codebook. It can be clearly observed from the table that the PSNR values for all VI images are +∞, which shows that our proposed scheme can reconstruct VQ-compressed images fully.

Our scheme supports adjustable thresholds. Table 2 shows embedded capacities (ECs) for codebook sizes 64, 128, 256, 512, and 1024, with different thresholds from 2 to 8. It is worth mentioning that with codebook sizes 512 and 1024, we observe that the ECs of

t = 5

decreases compared with the ECs of

t = 4

. The main reason is that when

t

is larger, the number of embeddable pixels decreases even the embedded bits for a pixel may increase. The combination of these factors leads to the changes in the ECs. We also observe that when

t = 8

, the embedding capacity is the largest regardless of the codebook size and this result is in line with the methodology of our proposed scheme. We can see the reason from Figure 7, as when

t = 8

, every label can embed 3 bits, so a high embedding capacity can be achieved. With a codebook size of 1024, the embedding capacity can be as high as 27,425 bits.

Table 3 shows the comparison between our scheme with

t = 8

and those of Liu et al. [17] and Chang et al. [18]. We compared the embedding capacity (EC) and embedding rate (ER) for codebooks of sizes 64, 128, 256, 512, and 1024. The ER is expressed in bits per pixel (bpp), calculated as the total embedded bits divided by the total number of pixels in the codebook, as shown in Equation (11). It can be intuitively illustrated from the table that the embedding capacity of our scheme is far better than the other two schemes for codebooks of any size. For a 64-size codebook, the embedding rate is as high as 1.8838 bpp. Compared with the latest scheme by Chang et al., the improvement rate reaches as high as 223.66%. Even for a codebook of size 1024, the improvement rate is as high as 85.19% which fully reflects the superior performance of our scheme.

E R = \frac{E C}{t o t a l n u m b e r o f p i x e l s}

(11)

We randomly sampled 12 images from the USC-SIPI image database [20], comprising images of various sizes (256 × 256, 512 × 512, and 1024 × 1024). These images were utilized to train codebooks sized 64, 128, 256, 512, and 1024. Figure 11 illustrates the comparisons of ECs with other codebook-based data embedding schemes.

5. Conclusions

Our proposed scheme introduces a novel approach for embedding data in VQ codebooks. We achieve this by XORing most pixel values in the codebook and applying a threshold to regulate data embedding. Through this method, we efficiently integrate both secret data and auxiliary information during the index reordering step of embedding. Upon receiving the stego codebook and the reordered index table, the recipient can losslessly extract the data and reconstruct the VQ-compressed image using the reverse process. Experimental validation of our scheme demonstrates a substantial enhancement in embedding capacity compared to state-of-the-art codebook-based methods. Specifically, we achieve an embedding rate of 1.8838 bpp with an impressive improvement rate of 223.66% using small codebooks of size 64. Similarly, we observe a notable improvement rate of 85.19% in larger codebooks of size 1024. These findings highlight the superior performance and efficacy of our proposed scheme when using VQ codebooks to embed data. Future research directions may encompass more study of carrier characteristics, achieving higher embedding capacity, and expanding towards verifiability. These endeavors hold significant implications for privacy protection, copyright preservation, and digital forensics.

Author Contributions

Conceptualization, Y.L., J.-C.L. and C.-C.C. (Chin-Chen Chang); methodology, Y.L., J.-C.L. and C.-C.C. (Chin-Chen Chang); software, Y.L.; validation, Y.L.; writing—original draft preparation, Y.L. and J.-C.L.; writing—review and editing, Y.L., J.-C.L., C.-C.C. (Chin-Chen Chang) and C.-C.C. (Ching-Chun Chang). All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Sahu, A.K.; Umachandran, K.; Biradar, V.D.; Comfort, O.; Sri Vigna Hema, V.; Odimegwu, F.; Saifullah, M.A. A study on content tampering in multimedia watermarking. SN Comput. Sci. 2023, 4, 222. [Google Scholar] [CrossRef]
Ramesh, R.K.; Dodmane, R.; Shetty, S.; Aithal, G.; Sahu, M.; Sahu, A.K. A Novel and Secure Fake-Modulus Based Rabin-3 Cryptosystem. Cryptography 2023, 7, 44. [Google Scholar] [CrossRef]
Ni, Z.; Shi, Y.Q.; Ansari, N.; Su, W. Reversible data hiding. IEEE Trans. Circuits Syst. Video Technol. 2006, 16, 354–362. [Google Scholar]
Chang, C.C.; Kieu, T.D.; Chou, Y.C. A high payload steganographic scheme based on (7, 4) hamming code for digital images. In Proceedings of the 2008 International Symposium on Electronic Commerce and Security, Guangzhou, China, 3–5 August 2008; IEEE: Piscataway, NJ, USA, 2008; pp. 16–21. [Google Scholar]
Zhang, X. Reversible data hiding in encrypted image. IEEE Signal Process. Lett. 2011, 18, 255–258. [Google Scholar] [CrossRef]
Huang, C.T.; Weng, C.Y.; Shongwe, N.S. Capacity-Raising Reversible Data Hiding Using Empirical Plus–Minus One in Dual Images. Mathematics 2023, 11, 1764. [Google Scholar] [CrossRef]
Zhang, Q.; Chen, K. Reversible Data Hiding in Encrypted Images Based on Two-Round Image Interpolation. Mathematics 2023, 12, 32. [Google Scholar] [CrossRef]
He, D.; Cai, Z. Reversible Data Hiding for Color Images Using Channel Reference Mapping and Adaptive Pixel Prediction. Mathematics 2024, 12, 517. [Google Scholar] [CrossRef]
Gray, R. Vector quantization. IEEE Assp Mag. 1984, 1, 4–29. [Google Scholar] [CrossRef]
Linde, Y.; Buzo, A.; Gray, R. An algorithm for vector quantizer design. IEEE Trans. Commun. 1980, 28, 84–95. [Google Scholar] [CrossRef]
Chang, C.C.; Hu, Y.C. A fast LBG codebook training algorithm for vector quantization. IEEE Trans. Consum. Electron. 1998, 44, 1201–1208. [Google Scholar] [CrossRef]
Hsieh, C.H.; Tsai, J.C. Lossless compression of VQ index with search-order coding. IEEE Trans. Image Process. 1996, 5, 1579–1582. [Google Scholar] [CrossRef] [PubMed]
Yang, C.H.; Lin, Y.C. Reversible data hiding of a VQ index table based on referred counts. J. Vis. Commun. Image Represent. 2009, 20, 399–407. [Google Scholar] [CrossRef]
Lee, J.D.; Chiou, Y.H.; Guo, J.M. Lossless data hiding for VQ indices based on neighboring correlation. Inf. Sci. 2013, 221, 419–438. [Google Scholar] [CrossRef]
Qin, C.; Hu, Y.C. Reversible data hiding in VQ index table with lossless coding and adaptive switching mechanism. Signal Process. 2016, 129, 48–55. [Google Scholar] [CrossRef]
Rahmani, P.; Dastghaibyfard, G. Two reversible data hiding schemes for VQ-compressed images based on index coding. IET Image Process. 2018, 12, 1195–1203. [Google Scholar] [CrossRef]
Liu, J.C.; Chang, C.C.; Lin, C.C. Hiding Information in a Well-Trained Vector Quantization Codebook. In Proceedings of the 2023 6th International Conference on Signal Processing and Machine Learning, Tianjin, China, 14–16 July 2023; pp. 287–292. [Google Scholar]
Chang, C.C.; Liu, J.C.; Chang, C.C.; Lin, Y. Hiding Information in a Reordered Codebook Using Pairwise Adjustments in Codewords. In Proceedings of the 2024 5th International Conference on Computer Vision and Computational Intelligence, Bangkok, Thailand, 29–31 January 2024. [Google Scholar]
Chen, K.; Chang, C.C. High-capacity reversible data hiding in encrypted images based on extended run-length coding and block-based MSB plane rearrangement. J. Vis. Commun. Image Represent. 2019, 58, 334–344. [Google Scholar] [CrossRef]
Weber, A.G. The USC-SIPI Image Database: Version 5. 2006. Available online: http://sipi.usc.edu/database/ (accessed on 11 April 2024).

Figure 1. Process of training a codebook.

Figure 2. Embedding using VQ codeword index reordering.

Figure 3. Data extraction using VQ codeword index reordering.

Figure 4. An example of ERLE algorithm. (a) Compression Phase. (b) Decomposition Phase.

Figure 5. Flow of Our Proposed Scheme.

Figure 6. An example of preprocessing phase in a segment of the codebook.

Figure 7. Encoding Rules for Different Thresholds.

Figure 8. An example of data embedding phase in a codeword (

t = 3

).

Figure 8. An example of data embedding phase in a codeword (

t = 3

).

Figure 9. An example of data extraction phase in a codeword (

t = 3

).

Figure 9. An example of data extraction phase in a codeword (

t = 3

).

Figure 10. Six 512 × 512 test images: (a) Egretta, (b) Elaine, (c) peppers, (d) Tiffany, (e) woodland, (f) Zelda.

Figure 11. Comparisons with other data embedding schemes [17,18] in the codebook trained by 12 random images from the USC-SIPI image database.

Table 1. PSNR comparisons of reconstructed images with their original images and reconstructed images with the VQ images reconstructed using the original codebook.

Codebook Size	64		128		256		512		1024
Image	OI	VI	OI	VI	OI	VI	OI	VI	OI	VI
Egretta	29.6811	+∞	30.4843	+∞	31.1983	+∞	31.9952	+∞	32.7203	+∞
Elaine	29.3081	+∞	30.1569	+∞	30.8513	+∞	31.4101	+∞	32.1139	+∞
Peppers	28.2237	+∞	29.1556	+∞	30.4569	+∞	31.3893	+∞	32.3213	+∞
Tiffany	27.0808	+∞	27.7653	+∞	28.4232	+∞	29.6245	+∞	30.4309	+∞
Woodland	31.1364	+∞	32.2857	+∞	33.0819	+∞	33.8991	+∞	34.6139	+∞
Zelda	31.8216	+∞	32.8539	+∞	33.6891	+∞	34.5065	+∞	35.2775	+∞

Table 2. Embedding capacity comparison under different codebook sizes and different thresholds.

Codebook Size		64	128	256	512	1024
$Threshold t$	2	1098	2206	4727	9888	20,478
	3	1449	2819	5833	12,022	24,413
	4	1581	3169	6470	13,312	26,846
	5	1745	3336	6560	12,889	26,185
	6	1834	3545	6714	13,119	26,374
	7	1885	3655	6943	13,456	26,809
	8	1929	3738	7114	13,821	27,425

Table 3. Comparison with other data embedding schemes in the codebook.

Codebook Size		64	128	256	512	1024
EC	Liu et al. [17]	264	649	1546	3595	8204
	Chang et al. [18]	596	1433	3128	6876	14,809
	Proposed Scheme	1929	3738	7114	13,821	27,425
ER	Liu et al. [17]	0.2578	0.3169	0.3774	0.4388	0.5007
	Chang et al. [18]	0.5820	0.6997	0.7637	0.8394	0.9039
	Proposed Scheme	1.8838	1.8252	1.7368	1.6871	1.6739
Improvement Rate		223.66%	160.85%	127.43%	101.00%	85.19%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lin, Y.; Liu, J.-C.; Chang, C.-C.; Chang, C.-C. Embedding Secret Data in a Vector Quantization Codebook Using a Novel Thresholding Scheme. Mathematics 2024, 12, 1332. https://0-doi-org.brum.beds.ac.uk/10.3390/math12091332

AMA Style

Lin Y, Liu J-C, Chang C-C, Chang C-C. Embedding Secret Data in a Vector Quantization Codebook Using a Novel Thresholding Scheme. Mathematics. 2024; 12(9):1332. https://0-doi-org.brum.beds.ac.uk/10.3390/math12091332

Chicago/Turabian Style

Lin, Yijie, Jui-Chuan Liu, Ching-Chun Chang, and Chin-Chen Chang. 2024. "Embedding Secret Data in a Vector Quantization Codebook Using a Novel Thresholding Scheme" Mathematics 12, no. 9: 1332. https://0-doi-org.brum.beds.ac.uk/10.3390/math12091332

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Embedding Secret Data in a Vector Quantization Codebook Using a Novel Thresholding Scheme

Abstract

1. Introduction

2. Related Work

2.1. Vector Quantization Codebook Training

2.2. VQ Codeword Index Reordering

2.2.1. Codebook Sorting

2.2.2. VQ Codebook Data Embedding

2.2.3. VQ Codebook Data Extraction

2.3. Extended Run-Length Encoding

3. Proposed Scheme

3.1. Preprocessing Phase

3.2. Data Embedding Phase

3.3. Data Extraction and Image Recovery Phase

4. Experimental Results

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI