Topic · Image generation
arxiv arXiv cs.LG · 8d ago

NoiseTilt: Noise-Tilted Reverse Kernels for Diffusion Reward Alignment

NoiseTilt introduces NTRK, a reward-guided diffusion sampler that injects reward gradients via the noise term without altering the reverse kernel. By using a whitening operator, NTRK safely biases noise toward high reward, preserving sample quality while maintaining strong guidance. On aesthetic generation, NTRK achieves superior reward performance with 25 NFEs, reducing compute by 20× compared to state-of-the-art baselines.

arxiv arXiv cs.LG · 18h ago

Atomistic Language Models Understand and Generate Materials

Atomistic Language Models (ALMs) unify language and atomistic structures, enabling natural language-driven crystal generation and optimization. ALMs use a continuous bridge to map language embeddings into atomistic diffusion steering space and employ Text-to-Crystal Feynman-Kac for stoichiometric accuracy. The ALM Bench benchmark evaluates text-conditioned material generation and optimization, with code and weights to be released soon.

media r/LocalLLaMA · 2d ago

Boogu-Image-0.1: Open-Source Unified Image Generation and Editing Model Series

Boogu-Image-0.1 is an Apache-2.0 licensed open-source unified image generation and editing model family, including Base, Turbo, and Edit variants. It offers high-quality text-to-image generation, fast generation, image editing, and strong Chinese-English text rendering, with training data scale roughly one order of magnitude smaller than closed-source systems yet achieving competitive performance through improved model understanding and data quality.