본문으로 건너뛰기

secrett2633's blog

카테고리

Python

PEP (650)

AI/ML

Review (3569)

OpenSource

PR Analysis (765)
vLLM (71)
SGLang (130)
llm-compressor (45)

Python

PEP (650)

AI/ML

Review (3569)

OpenSource

PR Analysis (765)
vLLM (71)
SGLang (130)
llm-compressor (45)

홈
#Pluralistic Alignment

#Pluralistic Alignment

2개의 포스트

[논문리뷰] Personalized RewardBench: Evaluating Reward Models with Human Aligned Personalization

arXiv에 게시된 'Personalized RewardBench: Evaluating Reward Models with Human Aligned Personalization' 논문에 대한 자세한 리뷰입니다.

#Review #Personalized RewardBench #Reward Modeling #Pluralistic Alignment #User Profile #Downstream Validation #Best-of-N #PPO

2026년 4월 8일댓글 수 로딩 중

[논문리뷰] Language of Thought Shapes Output Diversity in Large Language Models

arXiv에 게시된 'Language of Thought Shapes Output Diversity in Large Language Models' 논문에 대한 자세한 리뷰입니다.

#Review #Large Language Models #Output Diversity #Multilingual Reasoning #Language of Thought #Sampling Strategies #Pluralistic Alignment #Hidden State Analysis #Cognitive Science

2026년 1월 18일댓글 수 로딩 중

AI Review Python PEP PR Analysis RSS GitHub

© 2026 secrett2633. All rights reserved.