Diffusion Model for Decoder Encoder

Latent diffusion model provides efficient and high-quality paraphrase

LDP consists of a diffusion modeling for encoded text space of an off-the-shelf pre-trained encoder and decoder, the diffusion process can be intervened by additional controller . Paraphrase ...

IEEE

DVG-Diffusion: Dual-View-Guided Diffusion Model for CT Reconstruction From X-Rays

Abstract: Directly reconstructing 3D CT volume from few-view 2D X-rays using an end-to-end deep learning network is a challenging task, as X-ray images are merely projection views of the 3D CT volume.

Beebom

I Used the Best AI Models for a Month, and Here are the Top 10

Claude Opus 4.5 Coding tasks, long-running agents, software planning, general chatting Limited multimodal capabilities Paid plan starts at $17 per month Gemini 3 Pro Great at multimodal tasks, Deep ...

IEEE

DiffW: Multi-Encoder Based on Conditional Diffusion Model for Robust Image Watermarking

Abstract: The existing deep-learning based robust watermarking model generally applies a discriminator to form generative adversarial network (GAN) for increasing the quality of encoded images, and ...

marktechpost

This AI Paper Proposes a Novel Dual-Branch Encoder-Decoder Architecture for Unsupervised Speech Enhancement (SE)

Most learning-based speech enhancement pipelines depend on paired clean–noisy recordings, which are expensive or impossible to collect at scale in real-world conditions. Unsupervised routes like ...

EurekAlert!

Exploring a novel approach for improving generative AI models

The developed model modified Schrödinger bridge-type diffusion models to add noise to real data through the encoder and reconstructed samples through the decoder. It uses two objective functions, the ...

PNAS

Predicting the unseen: A diffusion-based debiasing framework for transcriptional response prediction at single-cell resolution

Understanding how cells respond to perturbations of genes is central to uncovering the rules of gene regulation. Single-cell RNA sequencing combined with CRISPR-based perturbations has transformed ...

GitHub

HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation

Base text-to-image Model: The first stage is a text-to-image model that utilizes two text encoders: a multimodal large language model (MLLM) to improve image-text alignment, and a multi-language, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results