#Transparency

6개의 포스트

[논문리뷰] Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting

본 논문은 현재 AI 평가 생태계가 파편화되어 있어, 모델의 성능 지표를 신뢰하거나 비교하기 어렵다는 점을 해결하고자 합니다. 기존 연구들은 평가의 특정 측면만을 다루거나 정적인 보고서 형태에 머물러 있어, 실제 평가 파이프라인에서 발생하는 데이터들을 체계적으로 통합하지 못합니다.

#Review #AI Evaluation #Reporting Framework #Reproducibility #Transparency #Interpretive Layer #Benchmark Metadata #Rollout Hierarchy

2026년 6월 8일

[논문리뷰] FASH-iCNN: Making Editorial Fashion Identity Inspectable Through Multimodal CNN Probing

본 연구는 현대 패션 AI 시스템이 특정 패션 하우스나 에디터의 심미적 논리를 데이터 내에 내재화하면서도, 이를 사용자에게 투명하게 공개하지 않는 불투명성 문제를 해결하고자 합니다.

#Review #Fashion AI #Multimodal CNN #Visual Channel Probing #Editorial Identity Encoding #Hierarchical Color Prediction #Transparency

2026년 4월 29일

[논문리뷰] daVinci-LLM:Towards the Science of Pretraining

현재 LLM 생태계는 상업적 모델의 폐쇄성과 학계 모델의 컴퓨팅 자원 부족이라는 구조적 역설(Structural Paradox)에 직면해 있습니다.

#Review #Pretraining #Data Darwinism #LLM #Transparency #Data Processing #Scaling Laws #Reasoning

2026년 3월 31일

[논문리뷰] Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance

AI/ML 논문 심사 과정에서 발생하는 저자 답변(rebuttal) 작성의 어려움을 해결하는 것을 목표로 합니다.

#Review #Multi-Agent Framework #LLM Agents #Peer Review #Rebuttal Generation #Evidence-centric Planning #Transparency #Human-in-the-loop

2026년 1월 21일

[논문리뷰] Blueprints of Trust: AI System Cards for End to End Transparency and Governance

본 논문은 AI 시스템의 개발 및 배포 과정에서 투명성과 책임성을 강화하기 위한 새로운 프레임워크인 Hazard-Aware System Card (HASC) 를 소개합니다.

#Review #AI Governance #Transparency #AI System Card #Hazard-Aware System Card #Data Provenance #AI Safety #AI Risk Management #ISO/IEC 42001

2025년 9월 26일

[논문리뷰] BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

현재 Deep-Research 에이전트 평가 벤치마크(예: BrowseComp)는 라이브 웹 검색 API 에 의존하여 공정성, 재현성 및 투명성 측면에서 중대한 한계를 가집니다.

#Review #Benchmarking #Deep-Research Agents #LLMs #Retrieval #Curated Corpus #Evaluation #Fairness #Transparency #Reproducibility

2025년 8월 12일