CVPR 2025 | Papers-with-Code |【合集一】AIGC相关（目前已更20篇，持续更新中）

CVPR2025 | AIGC

是柒号啊

2170人浏览 · 2025-03-13 16:57:52

是柒号啊 · 2025-03-13 16:57:52 发布

在这里插入图片描述
CVPR 2025 decisions are now available on OpenReview！22.1% = 2878 / 13008
会议官网：https://cvpr.thecvf.com/Conferences/2025

目前计划整理六个合集，部分合集未发布
【合集一】AIGC
【合集二】Mamba、MLLM
【合集三】底层视觉
【合集四】检测与分割
【合集五】三维视觉
【合集六】视频理解

欢迎转载，转载注明出处哦——————————————————————————————————————————————————————————————

扩散模型

1.《TinyFusion: Diffusion Transformers Learned Shallow》
paper: https://arxiv.org/abs/2412.01199
code: https://github.com/VainF/TinyFusion
在这里插入图片描述
2.《CleanDIFT: Diffusion Features without Noise》
paper: https://arxiv.org/pdf/2412.03439
code: https://github.com/CompVis/cleandift

3.《CacheQuant: Comprehensively Accelerated Diffusion Models》
paper: https://arxiv.org/pdf/2503.01323
code: https://github.com/BienLuky/CacheQuant
在这里插入图片描述
4.《Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models》
Paper: https://arxiv.org/abs/2501.01423
Code: https://github.com/hustvl/LightningDiT

图像生成

1.《Parallelized Autoregressive Visual Generation》
paper: https://arxiv.org/abs/2412.15119
code: https://github.com/Epiphqny/PAR
在这里插入图片描述
2.《PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation》
paper:https://arxiv.org/abs/2412.03177
code:https://github.com/hqhQAQ/PatchDPO

3.《SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models》
paper: https://arxiv.org/abs/2403.09055
code：https://github.com/ironjr/semantic-draw
在这里插入图片描述
4.《Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient》
paper: https://arxiv.org/pdf/2411.17787
code: https://github.com/czg1225/CoDe

5.《DreamText: High Fidelity Scene Text Synthesis》
paper: https://arxiv.org/abs/2405.14701
code: https://github.com/CodeGoat24/DreamText
在这里插入图片描述
6.《TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation》
paper: https://github.com/ByteFlow-AI/TokenFlow
code: https://arxiv.org/pdf/2412.03069

视频生成

1.《High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model》
paper: https://arxiv.org/abs/2502.19894
code: https://github.com/MingtaoGuo/Relightable-Portrait-Animation
在这里插入图片描述
2.《Identity-Preserving Text-to-Video Generation by Frequency Decomposition》
paper:https://arxiv.org/abs/2411.17440
code: https://github.com/PKU-YuanGroup/ConsisID

3.《WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model》
paper: https://arxiv.org/abs/2411.17459
code: https://github.com/PKU-YuanGroup/WF-VAE
在这里插入图片描述
4.《InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption》
paper: https://arxiv.org/abs/2412.09283
code: https://github.com/NJU-PCALab/InstanceCap

5.《PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation》
paper: https://arxiv.org/abs/2412.00596
code: https://github.com/pittisl/PhyT2V
在这里插入图片描述

图像编辑

1.《CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion》
paper: https://arxiv.org/pdf/2412.01792
code: https://ihe-kaii.github.io/CTRL-D/
在这里插入图片描述
2.《Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing》
paper: https://arxiv.org/abs/2411.16832
code: https://github.com/taco-group/FaceLock

3.《 $h$ -Edit: Effective and Flexible Diffusion-Based Editing via Doob’s $h$ -Transform》
paper: https://arxiv.org/abs/2503.02187
code: https://github.com/nktoan/h-edit
在这里插入图片描述
4.《EmoEdit: Evoking Emotions through Image Manipulation》
paper: https://arxiv.org/pdf/2405.12661
code: https://github.com/JingyuanYY/EmoEdit

5.《Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing》
paper: https://arxiv.org/abs/2411.16832
code: https://github.com/taco-group/FaceLock
在这里插入图片描述

有“AI”的1024 = 2048，欢迎大家加入2048 AI社区

更多推荐

【信创-k8s】海光/兆芯+银河麒麟V10离线部署k8s1.31.8+kubesphere4.1.3

介于V4优秀的LuBan架构，核心组件非常少，资源占用也显著降低，同时带来众多功能和便利性。：使用海光3350/兆芯开先KX-5000芯片，麒麟V10 SP3操作系统，以及Containerd 1.7.13、Kubernetes v1.31.8、KubeSphere v4.1.3等软件版本。原创编写，详细记录了从环境准备到平台验证的完整流程，为信创环境下的Kubernetes与KubeSphere

MCP：从被动响应到自主执行的自动化协议

AI正突破传统代码生成边界，向全流程智能调度演进。MCP（Model Context Protocol）作为开放协议，为AI与工具建立统一接口，实现跨模型、跨工具的复杂流程编排。其核心价值在于生态复用、安全可控和上下文感知，通过客户端-服务器架构让AI自主调用API完成从代码检查到性能优化的全流程。相比Function Call的单次调用，MCP支持多工具串联和本地数据处理，将重塑前端开发模式——

cover

CursorWindows环境与账号追踪机制分析

所有评论(0)

查看更多评论

是柒号啊

@weixin_42258721

已为社区贡献6条内容