'AI' 카테고리의 글 목록

« 2025/01 »
일	월	화	수	목	금	토
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

[평범한 학부생이 하는 논문 리뷰] Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code (ICLR 2024)

Paper : https://arxiv.org/abs/2310.01506 Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of CodeText-guided diffusion models have revolutionized image generation and editing, offering exceptional realism and diversity. Specifically, in the context of diffusion-based editing, where a source image is edited according to a target prompt, the process comarxiv.orgProject Page : https:..

AI/Diffusion Models 2025. 1. 8. 17:20

[평범한 학부생이 하는 논문 리뷰] EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models (ICLR 2024)

Paper : https://arxiv.org/abs/2401.11739 EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion ModelsDiffusion models have recently received increasing research attention for their remarkable transfer abilities in semantic segmentation tasks. However, generating fine-grained segmentation masks with diffusion models often requires additional training on anarxiv.org0. Abstract Diffusion m..

AI/Diffusion Models 2025. 1. 6. 14:40

[평범한 학부생이 하는 논문 리뷰] Video-P2P: Video Editing with Cross-attention Control (CVPR 2024)

Paper : https://arxiv.org/abs/2303.04761 Video-P2P: Video Editing with Cross-attention ControlThis paper presents Video-P2P, a novel framework for real-world video editing with cross-attention control. While attention control has proven effective for image editing with pre-trained image generation models, there are currently no large-scale video gearxiv.orgGithub : https://github.com/dvlab-resea..

AI/Video 2025. 1. 3. 23:21

[평범한 학부생이 하는 논문 리뷰] WAVE: Warping DDIM Inversion Features for Zero-shot Text-to-Video Editing (ECCV 2024)

0. AbstractKey Challenge : Naive DDIM inversion process의 각 step에서의 randomness와 inaccuracy에 의해 발생하는 error를 제한하는 것.이는 video editing에서 temporal inconsistency를 야기할 수 있다.1. Introduction 본 논문은 diffusion model을 이용해서 zero-shot video editing method를 만드는 것을 목표로 한다. Inversion process는 temporally cohorent initial latents의 sequence를 제공함으로써 video editing 결과에 도움을 준다. 그러나 아래의 이미지처럼 direct inversion process는 pot..

AI/Video 2024. 12. 19. 00:06

[평범한 학부생이 하는 논문 리뷰] DreamMotion : Space-Time Self-Similar Score Distillation for Zero-shot Video Editing (ECCV 2024)

Paper : https://arxiv.org/abs/2403.12002 DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video EditingText-driven diffusion-based video editing presents a unique challenge not encountered in image editing literature: establishing real-world motion. Unlike existing video editing approaches, here we focus on score distillation sampling to circumvent the stanarxiv.orgProject P..

AI/Video 2024. 12. 14. 23:35

[평범한 학부생이 하는 논문 리뷰] ControlNeXt: Powerful and Efficient Control for Image and Video Generation (arXiv 2408)

Paper : https://arxiv.org/abs/2408.06070 ControlNeXt: Powerful and Efficient Control for Image and Video GenerationDiffusion models have demonstrated remarkable and robust abilities in both image and video generation. To achieve greater control over generated results, researchers introduce additional architectures, such as ControlNet, Adapters and ReferenceNet, to intearxiv.orgGithub : https://g..

AI/Diffusion Models 2024. 11. 29. 17:02

[평범한 학부생이 하는 논문 리뷰] DragAnything : Motion Control for Anything using Entity Representation (ECCV 2024)

Paper : https://arxiv.org/abs/2403.07420 DragAnything: Motion Control for Anything using Entity RepresentationWe introduce DragAnything, which utilizes a entity representation to achieve motion control for any object in controllable video generation. Comparison to existing motion control methods, DragAnything offers several advantages. Firstly, trajectory-based isarxiv.orgProject Page : https://..

AI/Video 2024. 11. 12. 14:43

[평범한 학부생이 하는 논문 리뷰] MagDiff : Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing (ECCV 2024)

Paper : https://arxiv.org/abs/2311.17338 MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and EditingThe diffusion model is widely leveraged for either video generation or video editing. As each field has its task-specific problems, it is difficult to merely develop a single diffusion for completing both tasks simultaneously. Video diffusion sorely relyinarxiv.org1. Introduc..

AI/Video 2024. 10. 31. 02:00

[평범한 학부생이 하는 논문 리뷰] VIDEOSHOP : Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion (ECCV 2024)

Paper : https://arxiv.org/abs/2403.14617v3 Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion InversionWe introduce Videoshop, a training-free video editing algorithm for localized semantic edits. Videoshop allows users to use any editing software, including Photoshop and generative inpainting, to modify the first frame; it automatically propagates those charxiv.org0. ..

AI/Video 2024. 10. 29. 22:48

[평범한 학부생이 하는 논문 리뷰] Style Aligned Image Generation via Shared Attention (CVPR 2024)

Project Page : https://style-aligned-gen.github.io/ StyleAlignStyle Aligned Image Generation via Shared Attention CVPR 2024, Oral Amir Hertz* 1 Andrey Voynov* 1 Shlomi Fruchter† 1 Daniel Cohen-Or† 1,2 1 Google Research 2 Tel Aviv University *Indicates Equal Contribution †Indicates Equal Advising [Paper] style-aligned-gen.github.ioPaper : https://arxiv.org/abs/2312.02133 Style Aligned Image Ge..

AI/Diffusion Models 2024. 10. 16. 23:50

평범한 필기장

목록AI (50)

평범한 필기장

티스토리툴바