본문으로 건너뛰기

#Self-Correction

22개의 포스트

[논문리뷰] DMax: Aggressive Parallel Decoding for dLLMs

댓글 수 로딩 중

[논문리뷰] Revision or Re-Solving? Decomposing Second-Pass Gains in Multi-LLM Pipelines

댓글 수 로딩 중

[논문리뷰] DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation

댓글 수 로딩 중

[논문리뷰] OCR-Agent: Agentic OCR with Capability and Memory Reflection

댓글 수 로딩 중

[논문리뷰] UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

댓글 수 로딩 중

[논문리뷰] Distilling Feedback into Memory-as-a-Tool

댓글 수 로딩 중

[논문리뷰] ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement

댓글 수 로딩 중

[논문리뷰] VG-Refiner: Towards Tool-Refined Referring Grounded Reasoning via Agentic Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

댓글 수 로딩 중

[논문리뷰] RiddleBench: A New Generative Reasoning Benchmark for LLMs

댓글 수 로딩 중

[논문리뷰] Interleaving Reasoning for Better Text-to-Image Generation

댓글 수 로딩 중

[논문리뷰] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

댓글 수 로딩 중

[논문리뷰] Visual Document Understanding and Question Answering: A Multi-Agent Collaboration Framework with Test-Time Scaling

댓글 수 로딩 중

[논문리뷰] Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction

댓글 수 로딩 중

[논문리뷰] DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search

댓글 수 로딩 중

[논문리뷰] From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

댓글 수 로딩 중

[논문리뷰] PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold

댓글 수 로딩 중

[논문리뷰] Test-Time Policy Adaptation for Enhanced Multi-Turn Interactions with LLMs

댓글 수 로딩 중