Hiding data within NNs • Salika Dave

A paper review summary part of my coursework in IST597: Trustworthy Machine Learning

HiDDeN: Hiding data with deep networks

Jiren Zhu, Russell Kaplan, Justin Johnson, Li Fei-Fei

Summary

The paper introduces HiDDeN, a framework for data hiding using deep neural networks. It explores the use of neural networks to encode and decode hidden information in images, and shows that the encoded information can be robust to noise and image distortions. The framework comprises an encoder, a noise layer, a decoder, and an adversarial discriminator, and is trained to minimize distortion between cover and encoded images, detectability of encoded images, and message distortion. The paper also discusses related work in adversarial examples, steganography, watermarking, and data hiding with neural networks. The network architecture and training process are detailed, and the performance of the framework is evaluated in terms of capacity, secrecy, and robustness. HiDDeN’s end-to-end trainable framework, blind method, and ability to be used for both steganography and watermarking distinguish it from traditional steganography methods and other neural network-based approaches. Additionally, the model is trained with various noise layers to be robust against multiple noises simultaneously and is undetectable by a steganalyzer when the precise weights are unknown, demonstrating its advantage over static classical algorithms.

Results

The experimental setup in the paper involved evaluating the HiDDeN framework on three axes: capacity, secrecy, and robustness. Capacity was primarily measured using bits per pixel (BPP), which represents the number of message bits hidden per pixel of the encoded image. Secrecy was primarily measured using the detection rate, with a modern steganalyzer used to separate cover from stego images. Robustness was measured using bit accuracy, which represents the number of identical bits between the input message and the predicted message. The network architecture of HiDDeN includes an encoder that applies convolutions to the input image to form an intermediate representation. The message input is spatially replicated and concatenated to the encoder’s intermediary representation to ensure that each convolutional filter in the next layer has access to the entire message as it convolves across each spatial location. The encoder then produces the encoded image. The experimental results also show this framework outperforming Digimarc, a closed-source watermarking tool.

Strengths

The model can encode binary messages in images with high capacity and secrecy, indicating its effectiveness in concealing information within images. The framework is trained to minimize distortion between cover and encoded images, detectability of encoded images, and message distortion, showcasing an end-to-end training approach that optimizes multiple aspects of data hiding. The framework allows for flexibility in trading off between capacity, secrecy, and robustness to different types of noise, showcasing its adaptability to various data hiding requirements. The model is undetectable by a steganalyzer when the precise weights are unknown, demonstrating its advantage over static classical algorithms and its effectiveness in evading detection.

Possible directions for future work

While the paper stands strong in its claims, the experimental evaluations may not have included real-world scenarios or large-scale datasets, which could impact the generalizability of the results. While the paper compares HiDDeN with traditional steganography methods and other neural network-based approaches, it may not have extensively compared the framework with the latest state-of-the-art methods in data hiding and watermarking.