Upload an image or select an example to compare how different VAEs reconstruct it. Now includes remote VAEs via Hugging Face's remote decoding feature!
- The image is padded to a square and resized to the selected size (512 or 1024 pixels).
- Each VAE (local or remote) encodes the image into a latent space and decodes it back.
- Outputs include:
- Difference Maps: Where reconstruction differs from the original (white = difference > tolerance).
- Reconstructed Images: Outputs from each VAE.
- Sum of Differences: Total pixels exceeding tolerance (lower is better).
Adjust tolerance to change sensitivity.