평범한 필기장

« 2025/04 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

[평범한 학부생이 하는 논문 리뷰] Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing (CVPR 2025)

Paper : https://arxiv.org/abs/2411.15843 Unveil Inversion and Invariance in Flow Transformer for Versatile Image EditingLeveraging the large generative prior of the flow transformer for tuning-free image editing requires authentic inversion to project the image into the model's domain and a flexible invariance control mechanism to preserve non-target contents. However, thearxiv.org1. Introductio..

AI/Generative Models 2025. 4. 13. 18:47

[평범한 학부생이 하는 논문 리뷰] A Geometric Framework for Understanding Memorization in Generative Models (ICLR 2025)

Paper : https://arxiv.org/abs/2411.00113 A Geometric Framework for Understanding Memorization in Generative ModelsAs deep generative models have progressed, recent work has shown them to be capable of memorizing and reproducing training datapoints when deployed. These findings call into question the usability of generative models, especially in light of the legal andarxiv.orgAbstract본 논문은 memori..

AI/Generative Models 2025. 4. 2. 23:21

[평범한 학부생이 하는 논문 리뷰] A Geometric View of Data Complexity : Efficient Local Intrinsic Dimension Estimation with Diffusion Models (NeurIPS 2024)

Paper : https://arxiv.org/abs/2406.03537 A Geometric View of Data Complexity: Efficient Local Intrinsic Dimension Estimation with Diffusion ModelsHigh-dimensional data commonly lies on low-dimensional submanifolds, and estimating the local intrinsic dimension (LID) of a datum -- i.e. the dimension of the submanifold it belongs to -- is a longstanding problem. LID can be understood as the number ..

AI/Generative Models 2025. 4. 1. 16:42

[평범한 학부생이 하는 논문 리뷰] FireFlow: Fast Inversion of Rectified Flow for Image Semantic Editing (arXiv 2412)

Paper : https://arxiv.org/abs/2412.07517 FireFlow: Fast Inversion of Rectified Flow for Image Semantic EditingThough Rectified Flows (ReFlows) with distillation offers a promising way for fast sampling, its fast inversion transforms images back to structured noise for recovery and following editing remains unsolved. This paper introduces FireFlow, a simple yet effarxiv.orgAbstract Rectified Flow..

AI/Generative Models 2025. 3. 26. 02:32

[평범한 학부생이 하는 논문 리뷰] Flow Straight and Fast : Learning to Generate and Transfer Data with Rectified Flow (ICLR 2023)

Paper : https://arxiv.org/abs/2209.03003 Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified FlowWe present rectified flow, a surprisingly simple approach to learning (neural) ordinary differential equation (ODE) models to transport between two empirically observed distributions π_0 and π_1, hence providing a unified solution to generative modelingarxiv.orgAbstract 본 논문..

AI/Generative Models 2025. 3. 21. 22:34

[평범한 학부생이 하는 논문 리뷰] InitNO : Boosting Text-to-Image Diffusion Models via Initial Noise Optimization (CVPR 2024)

Paper : https://arxiv.org/abs/2404.04650 InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise OptimizationRecent strides in the development of diffusion models, exemplified by advancements such as Stable Diffusion, have underscored their remarkable prowess in generating visually compelling images. However, the imperative of achieving a seamless alignment betwearxiv.orgAbstract 모든 ra..

AI/Generative Models 2025. 3. 17. 18:02

[평범한 학부생이 하는 논문 리뷰] Classifier-Free Guidance inside the Attraction Basin May Cause Memorization (CVPR 2025)

Paper : https://arxiv.org/abs/2411.16738 Classifier-Free Guidance inside the Attraction Basin May Cause MemorizationDiffusion models are prone to exactly reproduce images from the training data. This exact reproduction of the training data is concerning as it can lead to copyright infringement and/or leakage of privacy-sensitive information. In this paper, we present aarxiv.orgAbstract해결하려는 문제 D..

AI/Generative Models 2025. 3. 13. 23:39

[평범한 학부생이 하는 논문 리뷰] Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)

Paper : https://arxiv.org/abs/2407.21720 Detecting, Explaining, and Mitigating Memorization in Diffusion ModelsRecent breakthroughs in diffusion models have exhibited exceptional image-generation capabilities. However, studies show that some outputs are merely replications of training data. Such replications present potential legal challenges for model owners, espearxiv.orgAbstract문제 : 생성모델의 몇 o..

AI/Generative Models 2025. 3. 6. 22:17

[평범한 학부생이 하는 논문 리뷰] Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations (ICLR 2025)

Project Page : https://rf-inversion.github.io/ Litu Rout1,2 Yujia Chen2 Nataniel Ruiz2 Constantine Caramanis1 Sanjay Shakkottai1Wen-Sheng Chu2 1 The University of Texas at Austin, 2 Google ICLR 202" data-og-host="rf-inversion.github.io" data-og-source-url="https://rf-inversion.github.io/" data-og-url="https://rf-inversion.github.io/" data-og-image=""> RF-InversionSemantic Image Inversion and ..

AI/Generative Models 2025. 2. 18. 17:10

[평범한 학부생이 하는 논문 리뷰] InstaFlow : One Step is Enough for High-Quality Diffusion-based Text-to-Image Generation (ICLR 2024)

Paper : https://arxiv.org/abs/2309.06380 InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image GenerationDiffusion models have revolutionized text-to-image generation with its exceptional quality and creativity. However, its multi-step sampling process is known to be slow, often requiring tens of inference steps to obtain satisfactory results. Previous attemparxiv.orgAbstr..

AI/Generative Models 2025. 2. 11. 00:39

평범한 필기장

목록전체 글 (103)

평범한 필기장

티스토리툴바