본문으로 건너뛰기

#LLM-as-a-Judge

21개의 포스트

[논문리뷰] Towards Human-Like Interactive Speech Recognition With Agentic Correction and Semantic Evaluation

댓글 수 로딩 중

[논문리뷰] Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] Specificity-aware reinforcement learning for fine-grained open-world classification

댓글 수 로딩 중

[논문리뷰] Evaluating Podcast Recommendations with Profile-Aware LLM-as-a-Judge

댓글 수 로딩 중

[논문리뷰] Are Today's LLMs Ready to Explain Well-Being Concepts?

댓글 수 로딩 중

[논문리뷰] Learning an Efficient Multi-Turn Dialogue Evaluator from Multiple Judges

댓글 수 로딩 중

[논문리뷰] CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards

댓글 수 로딩 중

[논문리뷰] Unified Reinforcement and Imitation Learning for Vision-Language Models

댓글 수 로딩 중